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INHIBITION OF GENE TRANSCRIPTION BY POLY AMIDE DNA-BINDING LIGANDS 

5 

The U.S. Government has certain rights in this invention pursuant to Grant Nos. GM 
26453, 27681, 47530 and AI 29182 awarded by the National Institute of Health. 

10 CROSS REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part of U.S. Provisional Applications S.N. 
60/038,384, filed Febmaiy 14, 1997, S.N. 60/038,394, filed February 14, 1997, S.N. 60/[CIT 
2683], filed September 2, 1997, and S.N. 60/[CIT 2684], filed September 10, 1997, which are 
incorporated by reference, U.S. application 08/853,022, filed April 21,' 1997, and 
15 PCT/US97/12722, filed July 21, 1997. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

20 This invention relates to polyamides that bind to predetermined sequences in the minor 

groove of double stranded DNA that are useful for diagnosis and treatment of diseases 
associated with gene transcription. This invention is related to modulation of cellular or viral 
gene expression required for maintenance and replication of pathogens in infectious disease, 
such as HIV-1 and CMV. This invention is also related to modulation of cellular gene 

25 expression in non-infectious disease conditions, such as cancers involving oncogenes, e.g., her- 
2/neu. 

Gene Therapy Approaches for HIV: 

30 Considerable effort has been expended over the past decade to devise methods to 

interfere with HIV-1 gene expression in living cells in the hope that therapeutic strategies will 
come from these studies (recently reviewed in Kohn, D.B. and N. Sarver, Gene therapy for 
HIV'I infection, in Antiviral Chemotherapy, J. Mills, P. A. Volberding, and L. Corey, Editors. 
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1996, Plenum Press: New York. p. 421-427.). One approach includes interference with the 
translation of messenger RNA into protein by the introduction of antisense oligonucleotides 
into lymphoid cells, as discussed in Kohn, D.B. and N. Sarver, Gene therapy for HIV-1 
infection, in Antiviral Chemotherapy, J. Mills, P. A. Volberding, and L. Corey, Editors. 1996, 
5 Plenum Press: New York. p. 421-427; Bordier, B,, et al. Proc. Natl Acad. ScL U.S.A., 92: 
9383-9387 (1995) and Lisziewicz, J., et ai , Proc. Natl. Acad. ScL U.S.A., 91: 7942-7946 
(1994). 

Another approach involves ribozyme-mediated destruction of specific regions of HIV-1 
10 mRNA. See Sun, L.Q., et ai, Proc. Natl Acad. Sci. U.S.A., 92: 7272-7276 (1995); Yamada, 
O., et al. , J. Virol, 70: 1596-1601 (1996) and Zhou, C, et al , Gene. 149: 33-39 (1994), 
Decoy molecules, corresponding to HIV-1 RNA domains that bind regulatory proteins required 
for the HIV-1 life cycle (TAR RNA which binds Tat or the Rev-response element) have been 
used as inhibitors of HIV-1 replication (Sullenger, B.A., et al , Cell 63: 601-608 (1990). In 
15 addition, /ra;/5-dominant mutant versions of these regulatory proteins, introduced into cells 
with retroviral expression vectors, have been shown to inhibit HIV-1 replication (Bevec, D., et 
al , Proc. Natl Acad. ScL U S /!., 89: 9870-9874, 1992,), 

Other approaches for direct inhibition of gene transcription, including designed or 
20 selected zinc finger peptides that recognize pre-determined DNA sequences, are described in 
Wu, H., et al., Proc. Natl Acad. Set. U.S.A., 92: 344-348 (1995) and Thiesen, H.-J., Gene 
Expr., 5: 229-243 (1996). DNA-clcaving ribozymes have also been tried (Raillard, S.A. and 
G,F. Joyce, Biochemistry, 35: 11693-11701(1996)). Triple helix-forming oligonucleotides 
have been used to block HIV-1 integration: Bouziane, M., et al., J. Biol Chem., 271: 10359- 
25 10364 (1996). Triple helix-forming oligonucleotides have also been used specifically cleave 
HIV-1 DNA with a metalloporj^hyrin group attached to the oligonucleotide, as described by 
Bigey, P., G. PratvieK and B. Meunier, Nucleic Acids Res., 23: 3894-3900 (1995). 
Additionally, the DNA-binding calicheamicin oligosaccharides have the potential for use in 
anti-HIV-1 therapy but have not as yet been applied to this disease. See Ho, S.N., et al, Proc. 
30 Natl Acad. ScL, 91: 9203-9307 (1994) and Liu, C, et al. Biochemistry, 93: 940-944 (1996). 

For any gene therapy approach to be successful, several criteria must be met by the 
therapeutic agent: First, the agent must not possess any general cell toxicity and should not 
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elicit an immune response. Second, the agent must be cell-permeable or amenable to viral 
delivery and, in the case of the DNA -binding agents, the therapeutic agent must transit to the 
nucleus and bind the target sequence with high affinity and specificity in the context of cellular 
chromatin. Third, binding of the agent to its DNA or RNA target sequence must interfere with 
gene transcription or protein translation. 

Each of the potential approaches listed above has its own unique advantages and 
limitations. For example, while nucleic acid-based approaches (antisense, decoy and triple 
helix-forming oligonucleotides and ribozymes) have the potential for sequence selectivity and 
can effectively inhibit transcription or translation in vitro, these molecules suffer from poor cell 
permeability and other delivery systems, such as retroviral vectors in the case of the ribozjmies 
(Zhou, C, et al., 1994) or liposomes or other delivery strategies in the case of antisense or 
triple helix oligonucleotides, must be used for effective gene inhibition (reviewed in Kohn & 
Sarver, 1996). Similarly, zinc finger peptides must be introduced via a gene therapy approach 
with an appropriate viral expression vector since these peptides cannot directly enter cells. See 
Choo, Y,, etaL, Nature, 372: 642-645 (1994). 

One additional problem with gene therapy approaches is that they must be performed 
on lymphoid cells ex vivo and, once an "HTV-protected" cell population is established, these 
cells must then be introduced into the patient. 

In contrast to gene therapy approaches, HIV protease inhibitors taken in combination 
with standard anti -retroviral agents (AZT) have recently shown success in clinical trials. Wei, 
X. et al, Nature, 373: 1 17-122 (1995); Ho, D.D. et aL, Nature, 373: 123-126 (1995). 

The key to the anti-HlV properties of these drugs is that they strike at two separate 
phases of the virus life cycle, limiting the ability of spontaneous mutations to result in 
inhibitor-resistant strains of the virus. Small molecule inhibitors of HIV-1 RNA transcription 
which would target a third phase of the virus life cycle would be highly desirable. Cell- 
permeable sequence-specific DNA-binding ligands would circumvent the problems associated 
with other forms of gene therapy and could compliment the protease inhibitor-anti-retroviral 
agent cocktail approach mentioned above. The calicheamicin oligosaccharides satisfy some of 
the requirements for a therapeutic agent; these molecules are sufficiently hydrophobic to pass 

<WO_9e3S702A1J,> 
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through cell membranes but these molecules possess severely limited sequence specificity (4 
base pairs) and bind DNA with very low affinities (100 (iM or higher required for inhibition of 
protein-DNA interactions . See Ho, S.N., et ai, Proa Natl. Acad. ScL. 91: 9203-9307 (1994) 
and Liu, C, et ai. Biochemistry, 93: 940-944 (1996)). 

5 

Thus, new classes of ccU-permeable molecules that possess higher degrees of DNA 
sequence specificity and affinity are needed for the treatment of AIDS and other infectious 
diseases. We describe below the successful development of a new class of highly specific 
designed small molecule ligands with great potential for inhibition of HIV-1 gene transcription, 

10 

The HIV-1 Enhancer and Promoter: 

A recent l eview has summarized our current knowledge of the protein factors required 
for the control of RNA initiation and elongation by RNA polymerase II at the HIV-1 promoter 

15 (Jones, K.A. and B.M. Peterlin. 1994, Control of RNA initiation and elongation at the HIV-1 
promoter. Annti. Rev, Biochenu, 63: 717-743). Thus only those aspects of HIV-1 transcription 
that relate to transcription inhibition are discussed herein. For HIV, the template for synthesis 
of both new viral RNA and messenger RNA (for viral protein synthesis) is the integrated 
pro virus, the product of reverse transcription of the viral RNA in the infected cell. HIV-1 

20 utilizes the transcription machinery of the host cell but encodes its own /ra«5-activators, Tat 
and Rev, that are responsible for RNA elongation and utilization. The HIV-1 promoter is 
located in the U3 region of the leftward (5') long terminal repeat (see Figure 1 1 below, taken 
from Jones and Peterlin, 1994). The core promoter and enhancer elements span a region of 
approximately 250 base pairs and include TATA and initiator elements and the binding sites 

25 for the following cellular transcription factors: Spl, NF-kB, LEF-l, Ets-1 and USF. 
Sequences upstream of the NF-kB sites contribute only marginally to HIV-1 promoter activity 
either in vitro or in transfcctcd cell lymphoid cell lines. Waterman, M.L. and K.A. Jones, New 
Biologist, 2: 621-636 (1990). Flowever, these upstream sequences, and presumably the protein 
factors which bind these upstream sequences, are important for viral replication, and hence 

30 promoter activity, in peripheral blood lymphocytes and in some T cell lines. Kim, J., et 
ai. J. ViroL 67: 1658-1662 (1993). 
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Two of tlic binding sites in the upstream region correspond to recognition sites for 
activator proteins that arc lymphoid cell specific (or highly enriched in T cells) and are shared 
with the promoter of the T cell receptor (TCRa) gene; these are the Ets-1 and LEF-1 
transcription factors. The essential role of the upstream region has recently been reproduced in 
5 vitro with a chromatin rcconstitution assay (Sheridan, P.L., et ciL, Genes Dev., 9: 2090-2104 
(1995)), 

Packaging of the HIV-1 promoter into nucleosomes strongly repressed transcription, 
but this repression couid be relieved by pre-incubation of the template with the HIV-1 

10 enhancer-binding proteins, LEF-1 and Ets-1. LEF-1 and Ets-I thus apparently act in concert to 
prevent nucleosome-mediated repression in vivo. Inhibition of formation of this complex by 
small molecule inhibitors could well represent a viable target for HIV-1 gene therapy. LEF-1 
is a member of the HMG family of proteins and binds DNA as a monomer. DNA binding is in 
the minor groove and results in a large distortion of the DNA hehx (unwinding and bending) 

15 (Love, J J., etal.. Nature, 376: 791-795 (1995)). 

In addition to acting as an architectural transcription factor, LEF-1 possesses a strong 
/ra/75-activation domain which can function when artificiality transferred to other DNA- 
binding proteins (Giese, K., et cii. Genes Dev., 9: 995-1008 (1995)). This region of the HIV-1 
20 enhancer might thus prove to be an effective target for inhibition of viral transcription and 
hence virus replication. 

The HIV-1 promoter also contains tandem binding sites for NF-kB, a factor that is 
strongly induced by HIV infection (Bachelerie, F., et al Nature, 350: 709-712 (1991)) and 

25 multiple binding sites for the general transcription factor Spl. The mechanisms of NF-kB 
activation have been reviewed by Jones and Peterlin, 1994. Important for this discussion, NF- 
kB has been showii to contact both Spl and the TBP subunit of the basal transcription factor 
TFIID. Perkins, N.D., el ai, Mol. Cell Bioi, 14: 6570-6583 (1994). Additionally, Spl has 
been shown to interact with the TAFIIO subunit of TFIID (llOkDa TBP-associated factor) 

30 (Chen, J.L., et al, Cell 79: 93-105, 1994), It is the binding of TFIID via the TBP interaction 
with the TATA element that nucleates the assembly of the complete RNA polymerase II 
transcription complex (reviewed in Maldonado, E. and D. Reinberg, Current Opinion in Cell 
Biology, 7: 352-361, 1995). Thus, NF-kB may function through recruitment of Spl and TFIID 
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to the HIV-1 promotor via these protein-protein interactions. Thus blocking the NF-kB-DNA 
and/or Spl-DNA interaction is another potential target for HIV therapy. Since these factors, 
and especially Spl and TFIID, are utilized by a wide range of cellular genes, the binding sites 
for these factors would not be good targets for HlV-specific inhibition (or any gene-specific 
5 inhibition). However, the sequences adjacent to these sites, that are unique to HIV-1 proviral 
DNA, are excellent candidate targets for the design of inhibitory DNA ligands (see below). 

Polyamide DNA-biuding Ligands: 

Simple rules have been developed to rationally determine the sequence-specificity of 

10 minor-groove-binding polyamides containing A^-methylimidazole and A^-methylpyrrole amino 
acids. These synthetic pyrrole-imidazole polyamides bind DNA with excellent specificity and 
very high affinities, even exceeding the affinities of many sequence-specific transcription 
factors (Trauger ei aL,, Nature, 382: 559-561, 1996). An Im/Py pair distinguishes G^C firom 
C*G and both of these from A«T or 1^A base pairs. Wade, W.S., et al. describes the design of 

15 peptides that bind in the minor groove of DNA at 5'-WGWCW-3' sequences (where W is 
either A or T, and a W«W pairs is an A«T or a T^A base pairs by a dimeric side-by-side motif 
J. Am. Chem. Sac. 114, 8783-8794 (1992); Mrksich, M. et al. describes antiparallel side-by- 
side motif for sequence specific-recognition in the minor groove of DNA by the designed 
peptide l-methyliniidazole-2-carboxamidenetropsin. Proc. Natl Acad. Sci. USA 89, 7586- 

20 7590 (1992); Trauger. J.W., et al., describes the recognition of DNA by designed ligands at 
subnanomolar concentrations. Nature 382, 559-561 (1996). A Py/Py pair specifies A«T from 
G*C but does not distinguish A»T from T*A. Pelton, J.G. & Wemmer, D.E. describes the 
structural characterization of a 2-1 distamycin A-d(CGCAAATTTGGC) complex by two- 
dimensional NMR. Proc. Natl. Acad, ScL USA 86, 5723-5727 (1989); White, S., et al. 

25 Biochemistry 35, 12532-12537 (1996) describes the effects of the A»T/T*A degeneracy of 
pyrrole-imidazole polyamide recognition in the minor groove of DNA, the pairing mles for 
recognition in the minor groove of DNA by pyrrole-imidazole polyamides, and also describes 
the 5 '-3' N-C orienuuion preference for polyamide binding in the minor groove. 

30 It has been found that a new aromatic amino acid, 3-hydroxy-N-methylpyrrole (Hp) 

when incorporated into a polyamide and paired opposite Py, provides the means to discriminate 
A*T from ^^A. White, S., et a/.. Nature 391 436-438 (1998). Unexpectedly, the replacement 
of a single hydrogen aloni on the pyrrole with a hydroxy group in a Hp/Py pair regulates the 
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affinity and the specificity of a polyamide by an order of magnitude. Utilizing Hp together 
with Py and Im in polyamides to form four aromatic amino acid pairs (Im/Py, Py/Im, Hp/Py, 
and Py/Hp) provides a code to distinguish all four Watson-Crick base pairs in the minor groove 
ofDNA. 

The preferred corresponding designed specific polyamides resulting from this invention 
are of the form 

X1X2 . . . Xm-Y-X(ni -f 1) . . , X(2m.l)X2m-P-Dp 

wherein X], X2, Xm, X(ni + 1), X(2m ~ 1)> and X2m are carboxamide residues 
forming carboxamide binding pairs Xi/X2m, X2/X(2m-1). Xm/X(rri + 1), and y is y- 
aminobuytic acid or 2A diaminobutyric acid and Dp is dimethylaminopropylamide, 
and where 

carboxamide binding pair Xi/X2m corresponds to base pair Ni*N'i, 
carboxamide binding pair X2/X(2m-1) corresponds to base pair N2*N'2, 
carboxamide binding pair Xn-i/X(in4-l) corresponds to base pair Nm*N'rn- 

In general, the specific polyamide DNA-binding Ugands were designed by using a 
method that comprises the steps of identifying the target DNA sequence 5'-WNiN2 . . . 
NmW-3*; representing the identified sequence as 5'-Wa6 . . . jcW-3', wherein a is a first 
nucleotide to be bound by the X] carboxamide residue, 6 is a second nucleotide to be bound by 
the X2 carboxamide residue, and .v is the corresponding nucleotide to be bound by the Xm 
carboxamide residue; defining a as A, G, C, or T to correspond to the first nucleotide to be 
bound by a carboxamide residue in the identified six base pair sequence, 

Carboxamide residues were selected sequentially as follows: Im was selected as the Xi 
carboxamide residue and Py as the X2m carboxamide residue if a was G. Py was selected as 
the Xi carboxamide residue and Im as the X2m carboxamide residue if a was C. Hp was 
selected as the X] carboxamide residue and Py as the X2m carboxamide residue if a was T. 
Py was selected as the X] carboxamide residue and Hp as the X2m carboxamide residue if a 
was A. 
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The remaining carboxamide residues were selected in the same fashion. Im was 
selected as the X2 carboxamide residue and Py as the X2m-1 carboxamide residue if b was G. 
Py was selected as the X2 carboxamide residue and Im as the X2m-1 carboxamide residue if b 
5 was C. Hp was selected as the X2 carboxamide residue and Py as the X2m-1 carboxamide 
residue if h was T. Py was selected as the X2 carboxamide residue and Hp as the X2m-1 
carboxamide residue if h was A. 

The selection of carboxamide residues was continued through m iterations. In the last 
10 iteration, Im was selected as the X^w carboxamide residue and Py as the Xm+1 carboxamide 
residue if x was G. Py was selected as the Xni carboxamide residue and Im as the Xm+1 
carboxamide residue if x was C. Hp was selected as the Xm carboxamide residue and Py as 
the Xm-hl carboxamide residue if a* was T. Py was selected as the Xm carboxamide residue 
and Hp as the Xm-hl carboxamide residue if a: was A. 

15 

In one preferred embodiment, the poiyamide includes at least four consecutive 
carboxamide pairs for binding to at least four base pairs in a duplex DNA sequence. In another 
preferred embodiment, the poiyamide includes at least five consecutive carboxamide pairs for 
binding to at least five base pairs in a duplex DNA sequence. In yet another preferred 
20 embodiment, the poiyamide includes at least six consecutive carboxamide pairs for binding to 
at least six base pairs in a duplex DNA sequence. In one preferred embodiment, the improved 
polyamides have four carboxamide binding pairs that will distinguish A^T, T*A, C»G and G*C 
base pairs in the minor groove of a duplex DNA sequence. 

25 DNA target sequence recognition thus depends on a code of side-by-carboxamide 

residue pairings in the minor groove of double stranded DNA. These compounds represent the 
only class of syntlietic small molecules that can bind predetermined DNA sequences with 
affinities and specitlcitics comparable to DNA-binding proteins. 

30 
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Summary Of The Invention 

This invention provides specific polyamides that are useful for modulating the 
expression of a cellular or viral gene by binding to predetermined target sequences adjacent to 
5 the binding site of a transcription factor protein in the minor groove of double stranded DNA. 
Suitable cellular genes include both eukaryotic and prokaryotic genes. The cellular gene can 
be present in the original native cells, in cells transfected or transformed with a recombinant 
DNA construct comprising the cellular gene or in an in vitro in a cell-free system. The viral 
gene can be present in a cell or in an //; vitro in a cell-free system. 

10 

The polyamides of the present invention can act as specific inhibitors of gene 
transcription in vivo or in vitro as therapeutic agents in disease conditions related to the 
transcription of at least one cellular or viral gene. Studies with three accepted model systems 
have shown that polyamides do interfere with the binding of sequence-specific minor groove 
15 transcription factor proteins as well as with components of the basal transcription machinery 
and thus block transcription of target genes. 

Hereinafter, N-methylpyrrole carboxamide may be referred to as "Py", N- 
methylimidazole carboxamide may be referred to as "Im", 3-hydroxy-N-methylpyrrole 
20 carboxamide may be referred to as "Hp", y-aminobutyric acid may referred to as "y"* P-alanine 
may be referred to as ''P", glycine may be referred to as "G", dimethylaminopropylamide may 
be referred to as ''Dp'\ and cthylenediaminetetraacetic acid may be referred to as "EDTA". 

The invention encompasses polyamides having y-aminobutyric acid or a substituted y- 
25 aminobutyric acid to fon^n a hairpin with a member of each carboxamide pairing on each side 
of it. Preferably the substituted y-aminobutyric acid is a chiral substituted y-aminobutyric acid 
such as (R)-2,4-diaminobutyric acid. In addition, the polyamides may contain an aliphatic 
amino acid residue, preferably a P-alanine residue, in place of a Hp or Py carboxamide. The p- 
alanine residue is represented in formulas as p. The P-alanine residue becomes a member of a 
30 carboxamide binding pair. The invention further includes the substitution as a p/p binding pair 
for non-Im containing binding pair. Thus, binding pairs in addition to the Im/Py, Py/Im, Hp/Py 
and Py/Hp are Im/[^, p/Im, Py/p, p/Py, Hp/p, p/Hp, and p/p. 
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The polyaniidcs of the invention can have additional moieties attached covalently to the 
polyamide. Preferably the additional moieties are attached as substituents at the amino 
terminus of the polyamide, the carboxy terminus of the polyamide, or at a chiral (R)-2,4- 
diaminobutyric acid residue. Suitable additional moieties include a detectable labeling group 
5 such as a dye, biotin or a hapten. Other suitable additional moieties are DNA reactive moieties 
that provide for sequence specific cleavage of the duplex DNA. 

A central aspect of the present invention is the use of unique or rare sequences adjacent 
to the binding sites for common transcription factors as the target sequences for the design of 

10 polyamides. It has been found that (1) sequences adjacent to the binding sites for required 
transcription factors are vmique to the genes under study and are not found in other genes in the 
current nucleic acid data bases; (2) polyamides targeted to these sequences are effective 
inhibitors of protein-DNA interactions; (3) such polyamides are inhibitors of transcription 
factor-dependent gene transcription in vitro; and (4) the polyamides are cell permeable agents 

15 and have been shown to inhibit transcription of target genes in human cells in culture. 

Most importantly, several designed polyamides have been shown to inhibit 
transcription of specific genes /'// vivo and thus these compounds must be both cell permeable 
and once inside the cell they must be able to transit the nuclear envelope and bind their target 
20 sites within chromatin (Gottesfeld, J.M., et al, Nature, 387: 202-205, 1997). These results 
demonstrate that designed pyrrole-imidazole polyamides are useful in the treatment of diseases, 
particularly viral diseases, including AIDS, as well as many other diseases for which specific 
candidate gene targets have been identified. 

25 The present invention provides specific polyamides which inhibit the transcription of 

DNA upstream or downstream of transcriptional factors such as the 5S RNA gene 
transcriptional factor TFIIIA, the minor groove-binding protein TATA-box binding protein 
(TBP), Ets-1 and the lymphoid enhancer factor LEF-1 protein. These polyamides act as gene- 
specific inhibitors of transcription since these polyamides are selective for the sequences 

30 flanking these protein binding sites that are, in turn, gene-specific. The polyamides are useful 
as therapeutics for the treatment of cancer as well as for the treatment of diseases caused by 
viruses and other pathogens (such as bacterial, fungal, etc) 
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The present invention provides a composition comprising a transcription inhibiting 
amount of at least one polyamidc chosen from the group consisting of ImPyPyPy-y-ImPyPy-P- 
Dp, ImPy-p-ImPy-y-IirLPy-f3-hiiPy-p-Dp and mixtures thereof and a pharmaceutically 
acceptable excipient suitable for the treatment of HIV-l infection. The invention also provides 
a method of treating a human patient with an HIV-l infection comprising the step of 
administering a composition comprising a transcription inhibiting amount of at least one 
polyamide chosen from the group consisting of ImPyPyPy-y-ImPyPy-p-Dp, ImPy-P-ImPy-y- 
ImPy-p-ImPy-P-Dj-) and mixtures thereof and a pharmaceutically acceptable excipient. 
Preferably a transcription inhibiting amount corresponds to an extracellular concentration of 
polyamide of about 100 nanomolar to about 10 micromolar. In one preferred embodiment, a 
transcription inliibiting amount corresponds to an extracellular concentration of about one 
micromolar to about ten micromolar ImPyPyPy-y-ImPyPy-P-Dp mixed with about one 
micromolar to about ten micromolar ImPy-p-ImPy-y-ImPy-P-ImPy-P-Dp. 

The present invention provides methods of treating cells in vitro as well as treating a 
human patient or a jion-human organism in vivo. In one preferred embodiment, the invention 
provides a method of treating HIV-l infected human blood cells in vitro comprising the step of 
administering a composition comprising a transcription inhibiting amount of at least one 
polyamide chosen iVom the group consisting of ImPyPyPy-y-ImPyPy-p-Dp, ImPy-p-ImPy-y- 
ImPy-P-ImPy-p-Dp and mixtures thereof. 

In other embodiments, the invention provides a diagnostic kit for detecting the 
identified target DNA sequence by employing the selective polyamides and a suitable system 
for the detection of the polyamide bound to the DNA. 

Brief Description of the Drawinas 

Figure 1 is a rcpresentaiion of a schematic model for the interaction of the nine zinc 
finger protein TFIllA with the 5S ribosomal RNA gene internal control region (ICR). 

Figure 2 is a representation of the results of DNase I footprinting analysis of the 
binding of polyamidc 1 and TFIIIA to the 5S RNA gene ICR. 
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Figure 3 is a representation of the results of experiments demonstrating inhibition of 5S 
RNA gene transcription in vitro. 

Figure 4 is a representation of the results of an experiment demonstrating the inhibition 
5 of 5S RNA gene transcription complex formation in vivo. 

Figure 5 is a schematic representation of the HIV-1 genome that encodes Gag, Pol, 
Env and six regulatory proteins as well as anticipated formally matched complexes of 1 and 4, 
and formally mismatched complexes of 2, 3, 5, and 6 with the HIV-1 target sequence 5'- 
10 TGCTGCA-3\ 

Figure 6 is a schematic representation of the structures of 5-y-5 polyamides 1-3, and 2- 
P_2-Y-2-p-2 polyamides 4-6. 

15 Figure 7 is a schematic representation of part of the sequence of the restriction fragment 

used in quantitative DNase I footprinting experiments. The three 7 base pair target sites 5'- 
TGCTGCA-3' (the HIV-l target site), 5^-TGGTGGA-3\ and 5'-TGTTACA-3^ are 
highlighted. 

20 Figure 8 is a representation of the results of a footprint experiment demonstrating the 

binding of polyamides 4-6 and 3. 

Figure 9 is a representation of the results of a MPE^Fe^^ footprint experiment using 
polyamides 4, 5, 6, and 3 at 1 nM. 

25 Figure 10 is a representation of the results of a MPE-Fe^'^ footprint experiment showing 

binding of a seven base pair sequence by polyamides 4, 5, 6, and 3. 

Figure 1 1 is a representation of the sequence of the HIV-1 enhancer / promoter element 
showing the binding sites for the cellular transcription factors Ets-1, LEF-1, NF-icB and SPl 
30 along with a canonical TBP binding site, or "TATA element/' 
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Figure 12 is a representation of the results of an experiment demonstrating inhibition of 
LEF-1 binding to the HIV-1 enhancer by polyamide 1. 

Figure 13 is a representation of the results of an experiment demonstrating binding to 
wild type and mutant forms of the HIV-1 enliancer by polyamide I. 

Figure 14 is a representation of the results of an DNase footprint experiment 
demonstrating binding to the HIV-1 enhancer of the polyamide ImPy-P-ImPy-y-ImPy-p-ImPy- 
p-Dp, also called 

Figure 15 is a representation of the inliibition of HIV-1 transcription by the polyamide 
ImPy-p-ImPy-y-lmPy-p-lmPy-P-Dp, also called HIV-1. 

Figure 16 is a schematic model of the binding of the polyamides ImPy-p-ImPy-Y-ImPy- 
P-ImPy-p-Dp, also called lIIV-1, and Imlm-p-Imlm-y-PyPy-p-PyPy-p-Dp, also called HIV'2 
to the DNA sequences adjacent to the HlV-1 TATA box. 

Figure 17 is a schematic model of the binding of polyamides designed for the 
recognition of TFIID (la - Ic) and Ets-l/LEF-1 (2a-2b) binding sites. 

Figure 18 is a representation of the HIV-1 promoter and DNA-binding sites for 
polyamide hairpins designed to target the HIV-1 promoter. 

Figure 19 is a representation of the results of a footprint experiment comparing the 
binding of polyamides and TBP. 

Figure 20 is a representation of the results of a footprint experiment comparing the 
binding of polyamides and LEF-1. 

Figure 21 is a representation of the results of a footprint experiment comparing the 
binding of polyamides, HIV-1 and CMV. 
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Figure 22 is a graphical representation of the resuhs of a experimental treatment of 
human white blood cells infected with HIV. 

Figure 23 is a graphical representation of the results of a experimental treatment of 
human white blood cells infected with HIV. 

Figure 24 is a graphical representation of the results of a experiment demonstrating the 
specificity of polyamides. 

Figure 25 is a schematic representation of polyamide binding to the DNA sequence 
adjacent to the Els-1 site. 

Figure 26 is a representation of the results of a footprint experiment comparing the 
binding of polyamides and AN331 to the HIV-1 promoter region. 

Figure 27 is a representation of the results of experiments showing the specific Ets-1 
binding inhibition produced by polyamides. 

Figure 2S is a representation of the results of a footprint experiment comparing the 
binding of polyamides and AN33 1 to the HIV-1 promoter region. 

Figure 29 is a schematic model of the binding of polyamides designed for the 
recognition of DNA sequence adjacent to the CMV TBP binding site and the repressor binding 
site. 

Figure 30 is a representation of the results of a footprint experiment comparing the 
binding of polyamides 1 and 5 and IE86 to the repressor region of the CMV genome. 

Figure 31 is a representation of the results of a footprint experiment comparing the 
binding of polyamides 3 and 4 and IE86 to the repressor region of the CMV genome. 

Figure 32 is a representation of the results of an experiment comparing the binding of 
polyamide I and IE86 on the transcription of CMV RNA. 
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Detailed Description of Preferred Embodiments 

Pyrrole-imiclazole polyamides represent the only class of small molecules to date that 
5 can bind any predetcrn^ined DNA sequence. DNA recognition depends on side-by side amino 
acid pairings in the minor groove. A pairing of imidazole (Im) opposite pyrrole (Py) targets a 
G-C base pair. A Py/Py combination is degenerate and targets both T-A and A-T base pairs. 
However, Hp/Py is specific for a T-A base pair and Py/Hp is specific for an A-T base pair. 
The generality of these pairing rules has been demonstrated by targeting a variety of sequences 
10 5-13 base pairs in size and is supported directly by NMR structural studies. Eight-ring hairpin 
polyamides have affinities and specificities comparable to DNA-binding transcription factors. 

Solid phase methods for synthesizing the polyamides of the present invention have been 
described by Baird & Dervan (./ Am, Cliem. Soc, 118, 6141 (1996)). Alternatively, the 
15 polyamides can also be synthesized via solution phase methods as described by Weiss et al. {J, 
Am, Chenu 5oc., 79, 1266 (1957)). 

Many protein coding genes utilize both gene- and tissue- specific transcription factors as 
well as general transcripfion factors for transcription of mRNA by RNA polymerase II. The 

20 binding sites for these protein factors are found in numerous genes, whereas the sequences 
adjacent to these binding sites tend to be unique for each gene. Polyamide ligands can be 
designed which target both the sequences adjacent to the binding sites for these transcription 
factors as well as to the binding sequences for these factors. Polyamides that target these 
sequences will interfere with the binding of the protein factors to DNA and thereby inhibit 

25 transcription by RNA Polymerase II. 

For example, the role of the nine zinc finger protein TFIIIA in the transcriptional 
regulation of the 5S RNA gene by RNA polymerase III has been extensively characterized. 
Zinc fingers 1-3, 5 and 7-9 bind the internal control region (ICR) of the gene through base- 
30 specific interactions in the major groove. Fingers 4 and 6 are essential for high affinity DNA 
binding and have been proposed to bind in or across the minor groove (Figure 1). Well 
estabhshed methods exist for assessing in living cells the status of RNA polymerase III 
transcription complexes on the genes encoding the small 5S ribosomal RNA, 
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In another embodiment, polyamides have been designed and synthesized that recognize 
and bind the sequences immediately adjacent to the site at which the minor groove-binding 
protein TATA-box binding protein (TBP) binds to TATA DNA can be designed. DNA 
5 sequences adjacent lo the TATA elements are gene-specific whereas TATA elements are found 
in many protein-codmg genes. For example, a polyamide bound to the sequence adjacent to 
the HIV- 1 TATA element has been shown to inhibit HIV-1 promoter-specific transcription by 
RNA polymerase II. A polyamide designed to selectively bind this site would be useful for 
treating diseases associated with HI V-1 infection. 

10 

In a third embodiment, a polyamide recognizes and binds the sequence immediately 
adjacent to the binding site of lymphoid enhancer factor LEF-1 in.the HIV-l enhancer. This 
polyamide inhibits the binding to LEF-1 protein to HIV-1 DNA. As above, a polyamide 
designed to selectively bind this site would be useful for treating diseases associated with HIV- 
15 1 infection. 

In another embodiment, a polyamide recognizes and binds to an identifed target 
sequence adjacent lo the transcription factor protein binding site of a cellular gene. In one 
preferred embodiment, the cellular gene is a constitutively expressed gene under basal 
20 transcription control. An preferred cellular gene under basal transcription control is the gene 
encoding the 5S ribosomal subunit. 

In yet another preferred embodiment, the minor groove transcription factor protein of 
the cellular gene is fBP. Such preferred cellular genes include oncogenes such as LEF-1, Ets- 
25 1 and her-2/neu. Other such preferred cellular genes include genes encoding cytokines such as 
interleukins, including IL-2, IL-5 and lL-13, tumor necrosis factors, including TNF-alpha and 
TNF-beta, growth factors, including TGF-beta, and colony stimulating factors, including GM- 
CSF. 

30 Using the above described rules, a sequence-specific polyamide can be designed that 

selectively binds to an identified target site adjacent to the binding site of a minor groove 
transcription factor protein. As used herein, "adjacent'' includes I) polyamide binding sites 
wherein an end nucleotide base pan' of the polyamide binding site is immediately continguous 
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to an end nucleotide of tlic minor groove transcription factor protein binding site, 2) polyamide 
binding sites wherein one to five nucleotide base pairs of the polyamide binding site are shared 
with the binding sites of the mii^or groove transcription factor protein, and 3) polyamide 
binding sites wherein the polyamide binding site is separated from the minor groove 
5 transcription factor protein binding site by from one to four intervening nucleotide base pairs. 
The binding affinity of such a designed polyamide should be greater than the binding affinity 
of the native transcriptional element in order to inhibit transcription. The binding affinity can 
be ascertained by competitive inhibition experiments against a native transcription factor. 

10 Within this application, unless otherwise stated, definitions of the terms and illustration 

of the techniques of this application may be found in any of several well-known references 
such as: Sambrook, J., cl aL, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratoiy Press (1989); Goeddel, D., ed., Gene Expression Technology, Methods in 
Enzymology, 185, Academic Press, San Diego, CA (1991); "Guide to Protein Purification" in 

15 Deutshcer, M.P., etl.. Methods in Enzymology, Academic Press, San Diego, CA (1989); Innis, 
et al., PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego, CA 
(1990); Freshney, R.I., Culture of Animal Cells: A Manual of Basic Technique, 2nd Ed., Alan 
Liss, Inc. New York, NY (1987); Murray, EJ., ed., Gene Transfer and Expression Protocols, 
pp. 109-128, The Humana Press Inc., Clifton, NJ and Lewin, B., Genes VI, Oxford University 

20 Press, New York ( 1 997). 

For the purposes of this application, a promoter is a regulatory sequence of DNA that is 
involved in the binding of RNA polymerase to initiate transcription of a gene. A gene is a 
segment of DNA involved in producing a peptide, polypeptide or protein, including the coding 

25 region, non-coding regions preceding ("leader') and following ("trailer") the coding region, as 
well as intervening non-coding sequences ("introns") between individual coding segments 
("exons'"). Coding refers to the representation of amino acids, start and stop signals in a three 
base "triplet" code. Promoters arc often upstream (" 5' to") the transcription initiation site of 
the corresponding gene. Other regulatory sequences of DNA in addition to promoters are 

30 known, including sequences involved with the binding of transcription factors, including 
response elements that are the DNA sequences bound by inducible factors. Enhancers comprise 
yet another group of regulatory sequences of DNA that can increase the utilization of 
promoters, and can function m eiliicr orientation (5'-3' or 3'-5') and in any location (upstream 
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or downstream) rclaiive lo llic promoter. Preferably, the regulatory sequence has a positive 
activity, i.e., binding of an cndogcneous ligand (e.g. a transcription factor) to the regulatory 
sequence increases transcription, thereby resulting in increased expression of the corresponding 
target gene, hi such a case, interference with transcription by binding a polyamide to a 
5 regulatory sequence would reduce or abolish expression of a gene. 

The promoter may also inckide or be adjacent to a regulatory sequence known in the art 
as a silencer. A silencer sequence generally has a negative regulatory effect on expression of 
the gene, hi such a case, expression of a gene may be increased directly by using a polyamide 
10 to prevent binding of a factor to a silencer regulatory sequence or indirectly, by using a 
polyamide to block transcription of a factor to a silencer regulatory sequence. 

It is to be understood that the polyamides of this invention bind to double stranded 
DNA in a sequence specific manner. The function of a segment of DNA of a given sequence, 

15 such as 5'-TATAAA-3\ depends on its position relative to other functional regions in the 
DNA sequence. In this case, if the sequence 5'-TATAAA-3' on the sense strand of DNA is 
positioned about 30 base pairs upsueam of the transcription start site, the sequence forms part 
of the promoter region (Lewin, Genes VI, pp. 831-835). On the other hand, if the sequence 5'- 
TATAAA-3* is downstream of the transcription start site in a coding region and in proper 

20 register with the reading frame, the sequence encodes the tyrosyl and lysyl amino acid residues 
(Lewin, Genes VI, pp. 213-215). 

While not being held to one hypothesis, it is believed that the binding of the polyamides 
of this invention modulate gene expression by altering the binding of DNA binding proteins, 
25 such as RNA polymerase, transcription factors, TBP, TFIIIA, the lymphoid enhancer factor 
protein LEF-1, and other minor groove transcription factor proteins. The effect on gene 
expression of polyamide binding to a segment of double stranded DNA is believed to be 
related to the function, e.g., promoter, of that segment of DNA. 

30 It is to he understood by one skilled in the art that the improved polyamides of the 

present invention may bind to any of the above-described DNA sequences or any other 
sequence having a desired effect upon expression of a gene. In addition, U.S. Patent No. 
5,578,444 in Tabic 1 lists numerous pathogens in which are found medically significant target 
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sequences for DNA-binding drugs and in Table II lists many non-infectious diseases that may 
be controlled at the level of DNA binding proteins. 

It is generally understood by those skilled in the art that the basic structure of DNA in a 
living cell includes both major and a minor groove. For the purposes of describing the present 
invention, the minor groove is the narrow groove of DNA as illustrated in common molecular 
biology references such as Levvin, B., Genes VI, Oxford University Press, New York (1997). 

To affect gene expression in a cell, which may include causing an increase or a 
decrease in gene expression, a quantity of one or more polyamides effective to inhibit 
transcription is contacted with the cell and internalized by the cell. The cell may be contacted 
by the polyamidc /// vivo or /// vitro. Effective transcription inhibiting extracellular 
concentrations of polyamides that can modulate gene expression range from about 10 
nanomolar to about 1 micromolar. Gottesfeld, J.M., et ai, Nature 387 202-205 (1997). To 
determine effective amounls and concentrations of polyamides in vitro, a suitable number of 
cells is plated on tissue culture plates and various quantities of one or more polyamide are 
added to separate wells. Gene expression following exposure to a polyamide can be monitored 
in the cells or in the medium by detecting the amount of the protein gene product present as 
determined by various techniques utilizing specific antibodies, including ELISA and western 
blot. Alternatively, gene expression following exposure to a polyamide can be monitored by 
detecting the amount of niessengcr RNA present as determined by various techniques, 
including northern blot and RT-PC]R. 

Similarly, to determine effective amounts and concentrations of polyamides for in vivo 
administration, a sample of body tissue or fluid, such as plasma, blood, urine, cerebrospinal 
fluid, saliva, or biopsy of skin, muscle, liver, brain or other appropriate tissue source is 
analyzed. Gene expression following exposure to a polyamide can be monitored by detecting 
the amount of the protein gene product present as determined by various techniques utilizing 
specific antibodies, including EI.JSA and western blot. Alternatively, gene expression 
following exposure to a polyamide can be monitored by the detecting the amount of messenger 
RNA present as detennined by various techniques, including northern blot and RT-PCR. 
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The polyai^iides of this invention may be formulated into diagnostic and therapeutic 
compositions for in vivo or in vitro use. Representative methods of formulation may be found 
in Remington: The Science and Practice of Pharmacy, 19th ed., Mack Publishing Co., Easton, 
PA (1995). 

5 

For in vivo use, the polyamides may be incorporated into a physiologically acceptable 
pharmaceutical composition that is administered to a patient in need of treatment or an animal 
for medical or research purposes. The polyamide composition comprises pharmaceutically 
acceptable carriers, excipients, adjuvants, stabilizers, and vehicles. The composition may be in 
10 solid, liquid, geh or aerosol fomi. The polyamide composition of the present invention may be 
administered in various dosage lonns orally, parentally, by inhalation spray, rectally, or 
topically. The term parenteral as used herein includes, subcutaneous, intravenous, 
intramuscular, intrastemai, infusion techniques or intraperitoneally. 

15 The selection of the precise concentration, composition, and delivery regimen is 

influenced by, inter alia, ihe specific pharmacological properties of the particular selected 
compound, the mtendcd use, the nature and severity of the condition being treated or 
diagnosed, the age, weight, gender, physical condition and mental acuity of the intended 
recipient as well as the route of administration. Such considerations are within the purview of 

20 the skilled artisan. Thus, the dosage regimen may vary widely, but can be determined routinely 
using standard methods. 

Polyamides of the present invention are also useful for detecting the presence of double 
stranded DNA of a specific sequence for diagnostic or preparative purposes. The sample 
25 containing the double stranded DNA can be contacted by polyamide linked to a solid substrate, 
thereby isohiting DNA comprising a desired sequence. Alternatively, polyamides linked to a 
suitable detectable marker, such as biotin, a hapten, a radioisotope or a dye molecule, can be 
contacted by a sample containing double stranded DNA. 

30 The design of bi functional sequence specific DNA binding molecules requires the 

integration of two separate entities: recognition and functional activity. Polyamides that 
specifically bind with siibnanomolar affinity to the minor groove of a predetermined sequence 
of double stranded DNA are linked to a functional molecule, providing the corresponding 
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bi functional conjugates useful in molecular biology, genomic sequencing, and human 
medicine. Polyamides of this invention can be conjugated to a variety of functional 
molecules, which can be independently chosen from but is not limited to arylboronic acids, 
biotins, polyhistidines comprised from about 2 to 8 amino acids, haptens to which an antibody 

5 binds, solid phase supports, oligodeoxynucleotides, N-ethylnitrosourea, fluorescein, 
bromoacetamide, iodoacetaniide, DL-a-lipoic acid, acridine, captothesin, pyrene, mitomycin, 
texas red, anthracene, anthrinilic acid, avidin, DAPI, isosulfan blue, malachite green, ethyl red, 
4-(psoraen-8-yloxy)-butyrate, tartaric acid, (-^)-a-tocopheral, psoralen, EDTA, methidium, 
acridine, Ni(II)*GIy-GIy-His, TO, Dansyl, pyrene, N-bromoacetamide, and gold particles. Such 

10 bifunctional polyamides are useful for DNA affinity capture, covalent DNA modification, 
oxidative DNA cleavage, and DNA photocleavage. Such bifunctional polyamides are useful 
for DNA detection by providing; a polyamide linked to a detectable label. Detailed 
instructions for synthesis of such bifunctional polyamides can be found in copending U.S. 
provisional application 60/043,444. the teachings of which are incorporated by reference. 

15 

DNA complexed to a labeled polyamide can then be determined using the appropriate 
detection system as is well known to one skilled in the art. For example, DNA associated with 
a polyamide linlced to biotin can be detected by a streptavidin / alkaline phosphatase system. 

20 The present invention also describes a diagnostic system, preferably in kit form, for 

assaying for the presence of the double stranded DNA sequence bound by the polyamide of 
this invention in a body sample, such brain tissue, cell suspensions or tissue sections, or body 
fluid samples such as CSF, blood, plasma or serum, where it is desirable to detect the presence, 
and preferably the amount, of the double stranded DNA sequence bound by the polyamide in 

25 the sample according to the diagnostic methods described herein. 

The diagnostic system includes, in an amount sufficient to perform at least one assay, a 
specific polyamide as a scparatel\ packaged reagent. Instructions for use of the packaged 
reagent(s) arc also typically included. As used herein, the term "package" refers to a 

30 solid matrix or material such as glass, plastic (e.g., polyethylene, polypropylene or 
polycarbonate), paper, foil and the like capable of holding within fixed limits a polyamide of 
the present invention. Thus, for example, a package can be a glass vial used to contain 
milligram quantities of a contemplated polyamide or it can be a microfiter plate well to which 
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microgram quantities of a contemplated polyamide have been operatively affixed, i.e., linked 
so as to be capable of being bound by the target DNA sequence. "Instructions for use" typically 
include a tangible expression describing the reagent concentration or at least one assay method 
parameter such as the relative amouats of reagent and sample to be admixed, maintenance time 

5 periods for reagent or sample admixtures, temperature, buffer conditions and the like. A 
diagnostic system of the present invention preferably also includes a detectable label and a 
detecting or indicating means capable of signaling the binding of the contemplated polyamide 
of the present invention to the target DNA sequence. As noted above, numerous detectable 
labels, such as bioiin, and deteciuig or indicating means, such as enzyme- linked (direct or 

10 indirect) strcptavidin, are well know n in the art. 



Example 1: 
Transcription inhibition in vitro and in vivo. 

15 

A high speed cytosolic extract from unfertilized Xenopus eggs was prepared as 
described in Hartl, et al., ./. Cell Biol 120, 613-624 (1993) . DNA templates for transcription 
were the somatic-lype 5S RNA gene contained in plasmid pXlsll (50 ng per reaction; 
Peterson et ciL, Cell 20, 131, 1980) and the tyrD tRNA gene contained in plasmid pTyrD (100 

20 ng per reaction: Stutz, et cii, denes Dev. 3, 1190, 1989) both from Xenopus laevis. 
Transcription reactions (20 \\L final volume) contained the following components: 2.5 |iL 
extract, 9 ng (12 nM) of TFIIIA isolated from immature oocytes (Smith et ai, Cell, 37, 645, 
1984), 0.6 mM ATP, UTP, CTP. 0.02 mM GTP and 10 ^iCi of [a- ^^P] GTP, and the final 
buffer components 12 mM HEPES (pH 7.5), 60 mM KCI, 6 mM MgCb, 25 \xM ZnCb, and 8% 

25 (v/v) glycerol. Plasmid DNAs were pre-incubated with polyamides in the same buffer prior to 
adding TFIIIA and other reaction components. RNA was purified and analyzed on a 
denaturing 6% polyacrylamide gel. A Molecular Dynamics Phosphorimager equipped with 
ImageQuant software was used \o quantify the effects of the polyamides on the relative 
transcription efficiencies of the 5S and TRNA genes. 

30 

Fibroblasts from a Xenopus kidney derived cell line (kindly provided by Dr. P. Labhart, 
The Scripps Research Institute) were grown at ambient temperature in 25 cm" culture flasks in 
Dulbecco's inoditlcd Eagle medium containing 10% (v/v) fetal calf serum. Cells were 
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passaged for a niinimum of three days prior to the addition of polyamide to the culture 
medium. Incubations were continued for various times and nuclei were prepared by hypotonic 
lysis and used cis templates for transcription as described. DNA content was determined by 
measuring the absorbance of an aliquot of the isolated nuclei in 1 % (w/v) sodium dodecyl 
5 sulfate (using an extinction coefficient at 260 nM of 1 AU = 50 )u,g/mL DNA). The buffer 
components and labeled and unlabeled nucleoside triphosphates were as for the plasmid 
transcription reactions. Reactions were supplemented with 2 |liL of RNA polymerase III (at 
approximately 50 |.ig/mL) isolated from Xenopus oocytes. 

10 Footprinting experiments reveal 90% inhibition of TFIIIA binding in the presence of 5 

nM polyamide. After incubation with 60 nM polyamide in a cell-free extract derived from 
Xenopus oocytes, the transcriptional activity of a 5S gene is inhibited by >80%. When 100 nM 
polyamide is supplied in culture medium containing Xenopus kidney cells, transcription 
complexes on the 5S RNA genes are selectively disrupted. These results demonstrate that 

15 pyrrole-imidazole polyamides are cell penneable and can inhibit the transcription of a specific 
gene in living cells. 

Polyamides were synthesized by solid phase methods as described in Baird, E. E. & 
Dervan, P. B. Am. Clicm. Sac. 118, 6141-6146 (1996). The identity and purity of the 
20 polyamides was verified by 'll NMR, matrix-assisted laser desorption/ionization time of flight 
mass speclrojnetry (MALDI-TOF-MS), and analytical HPLC. MALDI-TOF-MS: 1, 1223,4 
(1223.3 calculated for M+H); 2, 1222.3 (1222.3 calculated for M+H); 3, 1223.1 (1223.3 
calculated for M+H). 

25 

The eight-ring polyamide (labelled I in Figure lb) having the sequence composition 
lmPyPyPy-y-ImPyPyPy-[3-Dp was synthesized by solid phase methods and shown to bind the 
six base pair site 5'-AGTACT-3' at subnanomolar concentration (Figure 1). This sequence is 
within the binding site for /inc finger 4 of TFIIIA. Quantitative DNAse I footprint titration 
30 experiments reveal that polyamide 1 of Figure lb selectively binds the six base pair target 
sequence with a dissociation constant, Ka = 0,03 nM, a higher affinity than TFIIIA for its 50 
base pair site (K,i - 1 nM). Fm- controls, mismatch eight-ring polyamides ImPyPyPy-y- 
PyPyPyPy-P-Dp (labelled 2 in Figure lb) and ImPylmPy-Y-PyPyPyPy-P-Dp (labelled 3 in 
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Figure lb) were prepared which have 100- fold (Kj =2.0 nM) and 1000-fold (K<j =33 nM) lower 
affinities, respectively, for the 5'-AGTACT-3'site. The models of the binding of the 
polyamidcs to the itlentified target sequences of double stranded DNA are illustrated in Figure 
Ic. 

5 

The left side of Figure la shows a schematic model for the interaction of the nine 
zinc finger protein TFIIIA with the 5S ribosomal RNA gene internal control region (ICR). 
The middle section of Figure la shows the sequence of the ICR recognized by finger 4 in 
the minor groove. The six base pair region targeted by the designed "hairpin" polyamide is 
10 enclosed in a rectangle. The riglu section of Figure la shows the expected complex of 
ImPyPyPy-y-ImPyPyPy-p-Dp (polyamide 1) with its identified target DNA sequence, 5*- 
AGTACT-3'. Circles with dots represent lone electron pairs on N3 of purines and 02 of 
pyrimidines. Circles containing an H represent the N2 hydrogen of guanine. Putative 
hydrogen bonds are illustrated by dashed lines. 

15 

Figure lb shows the structures of polyamides ImPyPyPy-y-ImPyPyPy-P-Dp (1), 
ImPyPyPy-Y-PyPyPyPy-p-Dp (2). and hiiPylmPy-y-PyPyPyPy-p-Dp (3). The models of 
binding of these polyamides to tlie target duplex DNA sequence are shown in Figure Ic. 
The filled and empty circles represent imidazole and pyrrole rings, respectively, the curved 
20 line represents y-aminobutyric acid (y), and the diamond represents P-alanine. Hydrogen 
bond mismatches such as G-Py and A-Im are highlighted. 

The effect of polyamide labelled 1 in Figure lb (ImPyPyPy-y-ImPyPyPy-p-Dp) on 
TFIIIA binding to a restriction fragment isolated from a 5S RNA gene-containing plasmid was 

25 examined. Zri-3, a recombinant TFIIIA analog missing fingers 4-9, binds in the major groove 
of the C-block promoter element (sec Figure hi). DNase I footprinting demonstrates that zfi-3 
and polyamide 1 can co-occupy tlie same DNA molecule. When 5 nM polyamide 1 was 
preincubated with the same DNA target, the binding of nine finger TFIIIA was inhibited by 
>90% (Figure 2). Figure 2 shows (he result of DNase I footprinting analysis of the binding of 

30 polyamide I and TFIIIA to the 5S RNA gene ICR. 5'-end-labeled restriction fragments were 
derived from the 5S RNA gene by standard methods and footprinting reactions were as 
described ni Clemens, K.R., et ah. J. Mol. Dial. 244 23-35 (1994). Lane 1 shows a DMS G- 
specific sequencing ladder, lane 2. {Motcin-free DNA, lanes 3-5, digestion products obtained in 
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the presence of 0.5 nM, I iiJVl and 2 nM TFIIIA, respectively, lane 6, digestion products 
obtained in the presence of 5 nM polyamide 1. Lanes 7-9 show the digestion products of 
reactions ii^ which the DNA was incubated with 1 for 30 min prior to the addition of TFIIIA 
(at the same concentrations as in lanes 3-5, respectively) and incubated for an additional 30 
min prior to DNasc treatment. The differential inhibition of zfl-3 and full-length TFIIIA 
provides ev idence that finger 4 ititeracts with or is placed in the minor groove. Polyamide 1 
does not inhibit TFIIIA binding to 5S RNA. 

Transcription of the 5S RNA gene in an in vitro system was monitored in the presence 
of increasing concentrations (10-60 nM) of polyamide 1. The results are shown in Figure 3. In 
these experiments, polyamide 1 was added to a 5S RNA gene containing plasmid prior to the 
addition of exogenous TFIIIA (12 nM) and a crude extract derived from unfertilized Xenopus 
eggs. As a control, a tyrosine tKNA gene was included on a separate plasmid in these 
reactions. The tRNA gene has an upstream binding site for polyamide 1, but lacks a predicted 
protein-DNA interaction. Both genes are actively transcribed in this system, either individually 
(lanes 1 and 15) or in mixed template reactions (lane 2). Addition of 60 nM polyamide 1 
inhibits 5S gene transcription by >S0% (lane 6). Only a small degree of non-specific inhibition 
of tRNA transcription is observed at the concentrations of polyamide 1 required for efficient 5S 
RNA inhibition. The targeted 5S RNA gene is inhibited approximately lO-foId more 
effectively than the control tRNA gene. 

In tiiese exi:)eriments the DNA templates were incubated with polyamide for 30 min 
prior to the addition of TFIIIA, a cytoplasmic extract prepared from unfertilized Xenopus eggs, 
and labeled and unlabeled nucleoside triphosphates. Reactions contained either the somatic- 
type 5S RNA gene (Figure 3a, lane 1), a tRNA^^"^'"* gene (lane 15), or a mixture of both genes 
(lanes 2-14). Reactions contained the following final concentrations of the indicated 
polyamide: 0 nM, lanes 1, 2, 15; 10 nM, lanes 3, 7, 11; 20 nM, lanes 4, 8, 12; 40 nM, lanes 5, 
9, 13; 60 nM, lanes 6, 10, 14. Transcription reactions were stopped after 1 hour at ambient 
room temperature and the RNA products were analyzed on a denaturing polyacrylamide gel. 
The positions of 5S RNA atul the tRNA primary and processed transcripts are indicated at the 
right of the autoradiogram. In Figure 3b the results are represented graphically, with inhibition 
results expressed as the ratio of 5S RNA to tRNA transcription relative to the ratio obtained in 
the absence of polyamide. The error bars represent estimated standard deviations. 
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Mismatch polyaiiiides ImPyPyPy-y-PyPyPyPy-p-Dp (2) and ImPylmPy-y-PyPyPyPy- 
p-Dp (3) did not inhibit 5S RNA transcription at concentrations up to 60 uM. If the TFIIIA- 
DNA complex is first allowed to form, 30 nM polyamide 1 added, and the mixture incubated 
5 for 90 minutes prior to adding egg extract, efficient inhibition (80%) of 5S RNA transcription 
is also observed. Shoiler incubation times result in less inhibition. The required incubation 
time of 90 minutes is similar to the measured half-Hfe of the TFIIIA-DNA complex and 
supports that polyamide 1 forms a more stable complex with DNA than does TFIIIA. 

10 The effect of the polyamides on 5S gene transcription in vivo was determined (Figure 

4). Xenopus kidney-derived fibroblasts were grown in the presence of increasing 
concentrations of polyamide 1 in tlie culture medium for various times. Concentrations of 
polyamide up to 1 \yM were not toxic, as measured by cell density, if growth was limited to 
less than 72 hours. Nuclei were prepared from cells by hypotonic lysis and equivalent amounts 

15 of the isolated nuclei from control and treated cells were used as templates for transcription 
with exogenous RNA polymerase III and labeled and unlabeled nucleoside triphosphates. This 
experiment monitors the occupancy of class ITT genes with active transcription complexes. 5S 
RNA transcri|)tion can easily be assessed since the repetitive 5S genes give rise to a prominent 
band on a denaturing polyacr}damide gel. An autoradiogram was taken of the gel and the 

20 following observations made based on the observed autoradiogram. 

Nuclei were prepared from Xenopus kidney-derived fibroblasts that were grown in 
culture in the presence or absence of polyamides. Polyamides were included in culture 
medium (in 2.5 niL of media per 25 cnr t1ask) for 24 hours prior to harvesting cells and 

25 isolation of nuclei. Equal amounts of nuclei (containing 5 |Lig of DNA) were incubated for 2 
hours in 20 i^iL reactions containing Xoiopus RNA polymerase III and labeled and unlabeled 
nucleoside triphosphates. RNA was isolated from these reactions and analyzed on a denaturing 
polyacrylamide gel. The results are shown in Figure 4a in which lane 1 is a 5S RNA marker; 
lane 12 is a tRNA*-'^ gene marker; lane 2, cells not exposed to polyamide; lanes 3-11, cells 

30 exposed to the indicated polyamides 1, 2, or 3 at the following concentrations: 100 nM, lanes 3, 
6, 9; 300 niVl. lanes 4. 7, 10; I ^iVL lanes 5, 8, 1 1 . Figure 4b is a graphic representation of the 
effects of poiyamulcs 1, 2 and 3 on 5S RNA and tRNA transcription expressed as the ratio of 
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5S RNA to tRNA gene transcription relative to the ratio obtained in the absence of polyamide. 
Error bars represent estimated standard deviations. 

Concentrations of polyamide 1 as low as 100 nM have a pronounced and selective 
effect on 5S tianscription. At higher polyamide concentration, a general decrease in the 
transcriptional activity of the nuclei is observed. However, phosphorimage analysis of this 
experiment reveals that at each concentration tested, the effects of the polyamide are much 
greater on 5S RNA transcrption than on tRNA transcription. Having established that nearly 
maximal inhibition of 5S transcription is achieved with 1 |aM polyamide 1, nuclear 
transcription was monitored after various times of cell growth in the presence of the 
polyamide. No inliibition is observed for zero time incubation with polyamide 1 at 1 jiM 
concentration, indicating that disruption of transcription complexes does not occur during or 
after the isolation or work-up of cell nuclei. Statistically equivalent levels of 5S transcription 
were observed when the cells were exposed to polyamide 1 for 24, 48 or 72 hours. 

These nuclear transcription experiments indicate that polyamide 1 is able to enter cells, 
transit to the nuc lexis and disrupt transcription complexes on the cliromosomal 5S RNA genes. 
To rule out the possibility that the observed inhibition is due to some non-specific toxicity of 
the polyamide rather than to direct bindinii to the 5S RNA gene, the effects of mismatch 
polyamidcs 2 and 3 in the nuclear transcription assay were monitored. Only a small effect on 
5S RNA synthesis relative to tRNA synthesis is observed with 1 p.M of the mismatch 
polyamides 2 or 3 in the culture medium for 24 hours. This result indicates that the general 
inhibition of transcription observed with liigh concentrations of polyamide 1 may be a 
secondary effect of the inhibition of 5S RNA synthesis in vivo, rather than the result of non- 
specific polyamide interactions. Polyamide 2 affects a small enhancement of 5S RNA 
transcription in vitro and in vivo, indicating that polyamides may be able to upregulate 
transcription in certain cases. 
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Example 2: 

Design and Synthesis of Optimized Pol\ amides 
Targeted to HIV-1 Promoter Sequences 

5 The experiments of Example 1 showed that an eight-ring motif polyamide 

having the sequence composition hiiPyPyPy-y-ImPyPyPy-P-Dp binds specifically to the six 
base pair site 5'-AGTACT-3\ It has also been shown recently that the 10-ring, "5-y-5 motif 
polyamide ImPyPyPyPy-y-ImPyPyPyPy-P-Dp specifically binds a seven base pair 5'- 
WGWWWCW-3' target sequence with subnanomolar affinity in a hairpin conformation 

10 (Trauger, J.W., et aL, ./. Am. Chem. Soc,M% 6160-6166, 1996). Applying the polyamide 
pairing rules to the 5-y-5 molecular template suggested that the 10-ring hairpin polyamide 
ImPyPylmPy-y-lmPyPylmPy-p-Dp would bind the HIV-1 target sequences 5'-WGCWGCW- 
3'. However, as reported below, polyamide polyamide hiiPyPylmPy-y-ImPyPylmPy-P-Dp 
specifically binds the target site 5'-TGCTGCA-3\ but with relatively low affinity, 

15 necessitating development of a second-generation polyamide. 

Design of Second Generation Pol> amide 4. The simple amino acid P-alanine has 
been employed effectively as a single base pair-spanning linker in many different polyamides. 
In particular, in many cases in which a polyamide binds with relatively low affinity due to an 

20 apparent register mismatch between polyamide residues and their target DNA bases, 
substitution of one or more pynole residues with the fiexible spacer p-alanine results in marked 
increases in binding affinity. These results suggested that the 8-ring polyamide ImPy-p-ImPy- 
y-ImPy-P-ImPy-p-Dp (4), which differs from I only by the replacement of two pyrrole residues 
with p-alanines, would specifically bind the HlV-1 target sequences 5'-WGCWGCW-3' with 

25 high affinity using a novel 2-P-2-y-2-P-2 motif Equilibrium association constants were 
determined by quantitative DNase I footprint titration experiments of the formally matched 
(according to the polyamide pairing rules) polyamides 1 and 4 and the formally mismatched 
polyamides 2, 3. 5, and 6 for the HIV-1 tar^jct sequence 5'-TGCTGCA-3' (Figures 5b and 6). 
To further demonstrate the general usefulness of the 2-p-2-y-2-P-2 motif, the binding of 2 and 

30 5, and 3 and 6 to their respective fonnal match sequences 5'-TGGTGGA-3' and 5'- 

TGTTACA-3' was examined. iVlPE*Fc" footprinting and affinity cleavage studies were 
preformed for a subset of these compounds. Finally, the equilibrium association constants of 
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polyamide 4 for the HIV-1 target sequence 5'-TGCTGCA-3' were determined at either 24 
or 37 °C, and using either standard polyamide assay sohition conditions or approximate 
intracellular solution conditions. 

NMR spectra were recorded on a GE 300 instrun^ient operating at 300MH2 (^H). 
Spectra were recorded in DMSO-r/^ with chemical shifts reported in parts per million relative 
to residual DMSO-t/5. Polyamide concentrations were determined by UV absorbance 
measurements made on a Hewlett-Packard Model 8452A diode array spectrophotometer. 
Matrix-assisted, laser desorption/ionization time of flight mass spectrometry was carried out at 
the Protein and Peptide Microanalytical Facility at the California Institute of Technology. 
Analytical HPL.C was perfomicd on a HP 1090M analytical HPLC using a Rainen CI 8, 
Microsorb MV, 5 m, 300 x 4.6 mm reversed phase column in 0.1% (wt/v) TFA with 
acetonitrile as eluent and a flow rate of l.O ml/min, gradient elation 1.25% acetonitrile/min. 
Preparative HPLC was carried oitt on a Beckman instrument using a Waters DeltaPak 25 x 100 
mm, 100 m Ci8 column in 0.1% (wt/v) TFA, gradient elution 0.25%/min. CH3CN. E. coli 
XL-1 Blue competent cells w^ere obtained from Stratagene. Restriction endonucleases were 
purchased from Boeringher-Mannhemi or New England Biolabs. Plasmid isolation kits were 
obtained from Promega. Plasmid sequencing was carried at the Sequence Analysis Facility at 
the CaHfomia Institute of Technology. Scquenase (version 2.0) was obtained from United 

32 

States Biochemical, and DNasc I (FPLCpure) w^as obtained from Pharmacia, [a- P]- 

Thymidine-5'-t] iphosphate (>3000 Ci/mmol), [a- "P]-deoxyadenosine-5'-triphosphate (> 6000 

32 

Ci/mmol), and [a- P]-adenosine-5'-triphosphate were purchased from Du Pont/NEN. Water 
was obtained from a Millipore Milli-Q water purification system. 

Synthesis. Polyamides ucrc prepared by stepwise solid-phase synthesis using BOC- 
protected monomers as described by Baird. E.E.; Dcrvan, P.B. J. Am. Chem, Soc. 1996, 7/5, 
6141-6146. 

Plasmids. Plasmid pJT2B2 was prepared by hybridizing the complementary 
oligonucleotides, 

5'-CCGGCTTA AG TTCGTGGGCC ATGCTGCATT CGTGGGCCAT GGTGGATTCG 
TGGGCCATGT TACATTCG-3' and 
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5'-TCGACGAATG TAACATGGCC CACGAATCCA CCATGGCCCA CGAATGCAGC 
ATGGCCCACG AACTTAAG-3'. and ligaling the resulting duplexes to the large pUC19 Ava 
l/Sal 1 restriction fragment. 

5 The plasniid was transformed into E.coli. and plasmid DNA isolated using standard 

methods, and the sequence of the insert conHmied by sequencing. The 3'- P end-labeled Eco 

RI/Ay/ II restriction fragment was prepared by simultaneous digestion with Eco RI and Pvu II 

and 3 '-fill-in using Sequenase, [a- ^Tj-deoxyadenosine-S'-triphosphate, and [a- P]- 

thymidine-5'-triphosphate. The 282 bp fragment was isolated by nondenaturing gel 

32 

10 electrophoresis. The 5'- P-end-Iabeled EcoRl/Fvuli fragment was prepared using standard 
methods. See Sambrook, J.; Fritsch, E.F.; Maniatis, T. Molecular Cloning; Cold Spring 
Harbor Laboratory: Cold Spring Harbor, NY, 1989. 

A-specific chemical sequencing was carried out as described by Iverson, B.L,; Dervan, 
P.B. Nucleic Acids Res, 1987, 75, 7823-7830. Standard methods as described by Sambrook, et 

15 al. (1989) were used for all DNA manipulations. 

Quantitative DNase I Footprinting. Equilibrium association constants for polyamide- 
DNA complexes were determined by quantitative DNase 1 footprint titration experiments. 
Reactions were carried out in a total volume of 400 |j.L in the absence of carrier DNA. A 

20 polyamide stock solution or H2O (for reference lanes) was added to a solution containing 3'- 
end-labeled restriction fragment (15,000 cpm) affording final solution conditions of either 10 
mM Tris'HCl, 10 mM KCl, 10 mM Mg( h, 5 mM CaCl2, pH 7.0. at 24 °C, or 10 mM 
HEPES-HCl, 140 mM KCl, 10 mM NaCK I mM MgCl2, 1 mM spermine, pH 7.2 as indicated 
in Tables 1 and 2, The solutions were allowed to equilibrate at 24 °C or 37 ""C as indicated in 

25 Tables 1 and 2 for 12-16 hr. Cleavage reactions were initiated by the addition of 10 |LiL of a 
DNase I stock solution (at the appropriate concentration to give --55% intact DNA) containing 
1 mM dilhiothreitol, allowed to proceed for 7 min. at 24 °C, or 3.5 min, at 37 °C, and stopped 
by the addition of SO of a solution containing 2.25 M NaCl, 150 mM EDTA, 0.6 mg/mL 
glycogen, and 30 |.iM base-pair calf thynuis DNA, and precipitated with 1 mL ethanol. 

30 Reactions were then resuspcndcd in Ix TBE/SO% fonnamidc, heated at 85 °C for 10 min. 
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placed on ice, and the reaction products separated by electrophoresis on an 8% polyacrylamide 
gel (5% cross-link, 7 M urea) in Ix TBE at 2000 V. Gels were dried, exposed to a storage 
phosphor screen (Molecular Dynamics), and imaged using a Molecular Dynamics 400S 
Phosphorlmager. Data was obtained from the imaged gels by quantitation using ImageQuant 
5 software (Molecular Dynamics). Background-corrected volume integration of rectangles 
encompassing the footprint sites and a reference site at which DNase I reactivity was invariant 
across the titration generated values for the site intensities {^^{iq) and the reference intensity (L 

j.g|). The apparent fractional occupancy (Oapp) of the sites was then calculated using the 

equation: 

I silo / Ircf 

10 Oapp = 1 - ' , (1) 

I siU- / 1 r^f 

where i^gite ^^^^ '^ref ^^'^ ^^^^ ^'^'^ ^^^^ reference intensities, respectively, from a control lane to 
which no polyarnide was added. The ([L]|.q^, Qapp) data points were fit to a general Hill 
equation (eq, 2) by minimizing the difference between 9app and Ofit: 

9fu - Gmin + (Qmax - Qnnn ) ^— ^ (2) 

1 + Ka"[L]"tot 

15 where [^\q{ is the total polyarnide concentration, Ka is the equilibrium association constant, 
and Oniin and Oiaax are the experimentally detennined site saturation values when the site is 
unoccupied or saturated, respectively. The data were fit using a nonlinear least-squares fitting 
procedure with Ka, 9max, and Gmin as the adjustable parameters, and with a fixed value for n. 
For hairpin polyamides 1-6, binding isotherms were adequately fit by Langmuir isotherms (eq. 

20 2, n=l), consistent with formation of a 1:1 polyamide-DNA complexes. For unlinked 
polyamides 7-9 binding to their match sequences, binding isotherms were adequately fit by a 
cooperative isotherm (eq. 2, n=2) consistent with cooperative dimeric binding. Reported 
association constants are the average value obtained from at least three independent 
footprinting experiments. 

25 MPE'Fe^^ Footprinting and Affinity Cleavage Experiments. Exact binding sites 

were determined by MPE*Fe^^ footprinting. Binding orientation was probed using affinity 
cleavage experiments. All reactions were carried out in a total volume of 400 \xL, A stock 

solution of polyarnide (for MPE*Fc^' footprinting), polyamide-EDTA (for affinity cleavage), 
or H2O ffor reference lanes) was added to a solution containing 3'- or 5 '-end-labeled 
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restriclion fragment (15,000 cpm) affording final solution conditions of 20 mM HEPES, 200 
mM NaCl, 50 [Lig/mL glycogen, and pH 7.3. and the solution allowed to equilibrate for 12 hr at 

24 ^C. Next, for MPE*Fe^^ footprint ing reactions, MPE*Fe^^ was added to a final 
concentration of 0.5 ]iM and the solution allowed to equilibrate for 10 min. (a 5 fiM 

5 MPErFE^^ stock solution was prepared by mixing equal volumes of 10 |aM MPE and freshly 
prepared 10 |aM Fe(NH4)2(S04)2), and for affinity cleavage reactions, freshly prepared 
Fe(NH4)2(S04)2 vvas added to a final concentration of 1 |-iM and the solution allowed to 

equilibrate for 30 min. For both MPE»Fe'^ footprinting and affinity cleavage, cleavage was 
initiated by the addition of dithiothreitol to a final concentration of 5 mM and allowed to 

10 proceed for 30 min at 24 ""C, then stopped by adding 1 mL ethanol. Next, 10 |aL of a solution 
containing calf thymus DNA (140 f,iM base-pair) (Phamiacia) and glycogen (Boehringer- 
Mamiheim) (2.S mg/mL) was added, and the DNA precipitated. The reactions were 
resuspended in IX TBE/80% fomiamide, heated at 85 "^C for 10 min, placed on ice, and the 
reaction products separated by electrophoresis on an 8% polyacrylamide gel (5% cross-link, 7 

15 M urea) in IX TBE at 2000 V. Gels were dried and imaged as described above. Relative 
cleavage intensities were determined by volume integration of individual cleavage bands using 
ImageQuant software (Molecular Dynamics), 

Results and Discussion 

20 

Synthesis, 

Polyamides were synthesized by solid phase methods as described in Baird, E.E.; 
Dervan, P.B. J. Am, Chem. Soc, 118, 6141-6146 (1996). 

25 Bindiiij^ Affinities. 

Quantitative DNase I footprint titration experiments (10 mM Tris^HCl, 10 mM KCl, 10 

mM MgCb, 5 mM CaCl2, pH 7.0, 24 °C) were carried out on the 282 bp, 3'-^"^? end-labeled 
pJT2B2 EcoRl/Pvull restriction fragment which contains the three target sites 5'-TGCTGCA- 
3^ (the MIV-1 target sequence), 5'-TGGTGGA-3\ and 5^TGTTACA-3' (Figure 7). These 
30 experiments reveal that the 5-Y-5 motif polyamide I binds the HIV-1 target sequence 5'- 

TGCTGCA-3' ^^'ith a relatively low equilibrium association constant, Ka = 8 x 10^ M'^, while 
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no binding of the 5-y-5 polyamidc 2 to its target site 5'-TGGTGGA-3' is observed (Ka < 5 x 
10^ M"^). In contrast, 5-y-5 polyamide 3 binds its target site 5'-TGTTACA-3' with very high 
affinity (ICa - 5 x 10^° M"^). (Figure 8, Table 1). 

Quantitative DNase T footprinting experiments reveal that the three 2-P-2-Y-2-P-2 motif 
polyamides 4, 5 and 6 selectively target their respective match sequences 5'-TGCTGCA-3' 
(the HlV-1 target sequence), 5'-TGGTGGA-3\ and 5'-TGTTACA-3' with equiUbrium 

9 -1 

association constants greater than 1 xlO M (Figure 8, Table 1), demonstrating that the 2-p- 
2-Y-2-p-2 motif allows specific targeting of a range of 7 base pair sequences with 
subnanomolar affinity, and is generally useful for all 5'-WNNWNNW-3' sequences. 
Polyamide 4 specifically binds the HIV-1 target sequence 5'-TGCTGCA-3' with 

subnanomolar affinity (Ka = 2 x 10^^ M and thus represents a solution to the present 
polyamide design problem. 

Table 1. Equi)ibrtum association consianis fM 



Polyamidc 5-LiTGCTGCAt-3' 5'-aTGGTCGAi'3' 5'-aTGTTACAi-3* 



S-y-Smodf: 








I IrnPyPyImPy-Y-(mPyPy(mPy-p*Dp 


3.3 X lO'd.S) 


< ! X 10^ 


< I X 10^ 


2 ImlmPylmlm-7-PyPyPyPyPy-P-Dp 


< 5 X 10^ 


< 5 X U)^ 


< 5 X 10^ 


3 IniPyPyPyPy-Y-lmPyPyPyPy-f3'Dp 


\2 A 10''(0.4) 


< 5 X 10^ 


5.1 X id'** (1.1) 


2 /3-2-7-2-/^2 moiif: 








4 lmPy-(i-lmPy-7-lmPy-f54mPy-[5-Dp' 


1.5 X 10^^ (0.3) 


< 5 X IQ-^ 


< 5 X 10^ 


5 Imlm-p-Imltn-Y-PyPy-P-PyPy-P-Dp 


1.7 IQ-^ (0.3) 


7.6 X lo'' (1,0) 


< 1 X 10^ 


6 •fnPy-P-PyPy-y'[mPy-(3-PyPy-P-Dp 


< 1 -K 10^ 


< ) X 10^ 


2.6 X 10^ (1.0) 


Unlinked dirr\er motif: 








1 lmPyPyPyPy-[i-Dp 


\A \ 10' (0.2) 


< ! X 10^ 


3-9 X l0^(0•6) 


S imPy-(i-ImPy-[i.Dp 


3.0 X 10*^ (0.2) 


< 2 X 10^ 


1.5 X IONO.3) 


9 [(nPy-(3-PyPy-p-Dp 


< 5 X 10^ 


< 5 X 10^ 


2.3 X 10*^ (O.l) 



'Values reported arc ihc mean values from at Icasi ihrcc DN:ise I tootprini iitrniion experiments. The ^iiondard 
deviation for t:ach value is indicated in parentheiics. Tlie i:;says were corned out at 2-t "C. pH 7.0. in the presence of 10 
iTiM Tris-HCl. 10 mM KCi. 10 mM MgCK. ajid 5 tnM CaCi,. Nucleotides flanking polyamide binding sites arc in 
lowercase typo. Association constants corresponding to formally matched complexes arc in bold type. 



This polyamidc bmds the site 5'-(gttacatTCGTCCA--3" ;]djaccni to the 5'-TGTrACA-3* site with an equilibrium 
a;;.:ociauon constant of 2. 1 x 10'' M ' (0,1). 



Failure of Polyamide L 



The failure of polyamide 1 to bind the HlV-l target sequence with high affinity is 
consistent with previous results. For example, the 4-y-4 motif polyamides ImPyPyPy-y- 
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ImPyPyPy-P-Dp, IinPyPyPy-y-PyPyPyPy-p-Dp and ImlinPyPy-y-lmlmPyPy-P-Dp bind 

respective 6 base pair target sequences with subnanomolar affinity (Ka > 10^^ M'^), while 
ImPylmPy-y-IniPylnnPy-p-Dp and hiilmlmhii-y-PyPyPyPy-P-Dp bind respective target 

8 1 

sequences with substantially lower affinity (Ka < 10 M ). Based on these and additional 
5 results, the common feature of hairpin polyamides that bind with high affinity appears to be the 
placement of imidazole residues only at the first and second positions of polyamide subunits, 
counting from the N-terminal end, although in some cases substitution at the third position is 
tolerated. Imidazoles placed beyond the second position of a polyamide subunit are not 
positioned optimally for specific ligand-DNA contacts. 

10 

The 5-Y-5 polyamides 1, 2, and 3 are analogs of 4, 5, and 6, respectively, in which the 
P-alanine residues have been changed to pyrrole residues (Figure 6). Polyamides 1 and 2, 
which have imidazole residues beyond the second position of a polyamide subunit, have 
equilibrium association constants for their match sites more than 100-fold lower than their P- 
15 alanine-substituted analogs 4 and 5, respectively (Table 1). Thus, the 2-P-2-y-2-P-2 motif is 
essential for high-affinity recognition in these cases. In contrast, polyamide 3 having no 
imidazoles beyond the second position of a polyamide subunit binds its match site with an 
equilibrium association constant ~20-fold higher than its p-alanine-substituted analog 6 (Table 
!)• 

20 

The following model is consistent with our data: 1) In the 2-p-2-y-2-p-2 polyamides 4 
and 5 the flexible P-alanine linkers coiTectly position the imidazole residues following them for 
specific hydrogen bond contacts with their target guanine bases, while in 5-y-5 polyamides 4 
and 5 the analogous hydrogen bonds cannot form. 2) In contrast, for both 2-P-2-y-2-P-2 
25 polyamide 6 and polyamide 5-y-5 3, the pyrroles beyond the second positions within 
polyamide subunits are correctly positioned, and 3 binds with lower affinity than 6 due to its 
greater conformational entropy. 

These results indicate that the optimal binding positions, or "binding register," is 
30 different for imidazole and pyrrole residues, consistent willi previous results which suggest 
that, for subunits composed entirely of rings (i.e. without p-alanine linkers), imidazole residues 
go out of register after 2-3 residues with substantial drops in affinity, while pyrrole residues go 
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out of register after 5 rings with a gradual leveling of affinity before a substantial drop at 8 
rings. Finally, these results indicate that the 2-p-2-y-2-P-2 and S-y-S motifs are complementary 
for recognition of 7 base pair sequences, with the optimal motif dependent on the sequence 
composition of the desired target site. 

5 

Relative Effects of Covalcnt Linkage with y-Aniinobutyric Acid. 

Equilibrium association constants for the ''unlinked dimer motif polyamides 7, 8, and 
9 were measured to allow comparison with their corresponding y-aminobutyric acid-linked 
polyamides 3, 4, and 6, Polyamides 4 and 6, which are composed of 2-y-2 subunits, bind with 
10 -6000-fold and -1000-fold higher affinity, respectively, relative to their respective unlinked 
analogs 8 and 9. Polyamide 3, which is composed five-ring subunits, binds with -100-fold 
higher affinity than its unlinked analog 7. The significantly larger increase in affinity upon 
linkage with y-aminobutyric acid for 8 and 9 relative to 7 is consistent with the greater 
structural rigidity and consequently greater prcorganization of 7. 

15 

Exact Binding Sites and Binding Orientations. 

MPE»Fe'' footprinting experiments carried out with the four polyamides 4, 5, 6, and 3, 
which have subnanomolar binding affinities, confimi that tlicse compounds specifically bind 
their respective seven base pair match sites (Figures 9a and 10a). Affinity cleavage 

20 experiments were performed with the EDTA-polyamides 4-E, 5-E, 6-E, and 3-E to identify the 
location of the C~termini of these polyamides when bound to their target sites (Figures 6, 9b, 
I Ob). Tiie symmetric polyamides 4-E, 6-E, and 3'E produce cleavage patterns at both sides of 
their binding sites as expected, consistent with two disUnct binding orientations (Figure 9c). It 
appears tliat, in general, side-by-side polyamide-DNA comple^xes preferentially bind in the 5' 

25 to 3' (K- to C-terminus) orientation. Recent results of studies with hairpin polyamides are 
consistent with fonnation of 3' to 5'-oriented polyamide-DNA complexes, and suggest that 
such "reversed orientation" complexes typically have affinities -10-fold lower than analogous 
5' to 3' oriented complexes. Consistent with this trend, polyamide 4 binds the "reversed 
orientation" match site S'-TCGTCGAo' with an affinity -10-fold lower than it binds the site 

30 5'-TGCTGCA-3' (see footnote to lable 1). Inconsistent v/ith this trend, however, the 
assymmctric polyamide 5-E produces a cleavage pattern (Figure 10b) consistent with the 
polyamide binding preferentially in the reversed orientation (3' to 5\ N- to C-terminus) as 
illustrated (for the parent compound 5) in Figure 10c. 
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Effect of Physiological Salt and Teiiipcrntiire. 

Quantitative DNase I footprinting experiments indicate that increasing the equilibration 
tempcraiurc from 24 ""C to 37 ""C, and changing the solution conditions from standard 
5 polyaniide assay conditions (10 niM Tris-IJCl, 10 mM KCI, 10 miM MgCl2, 5 mM CaCl2, pH 
7.0 at 24 °C) to conditions modeliiig those encountered within a typical mammalian cell (140 
mM KCL 10 mM NaCl, 1 mM MgC!2, 1 nrVl spermine, pH 7.2) has little effect on polyamide 
binding affinity (Table 2). 

Table 2. Equihbriam association constants (M ") of 
polyamide 4 for its matc:h site 5'-TGCTGCA-3' Z 



Buffer^ Temp. 5^aTGCTGCAiL-3' 



A 


24 "C 


1.5 X 10'°(0.3) 




37 °C 


8.4 K 10'(2.31) 


B 


24 °C 


1.9 X 10'°(0.6) 




31 °C 


1.1 X I0'°(0.2) 



'Values rcponed are the mean values from at least three 
DNasc I footprmt ciiraition experimenis. The standard 
deviauon for each value is indicated in parentheses. 



'Buffer A\:.lO mM Tns^HCl. 10 mM KCI. 10 mM 
MgCU, and 5 mM CaCK. pH 7.0 ai 24 °C; Buffer B: 10 
mM HEPES-HCl. 140 mM KG), 10 mM NaCl, 1 mM 
MgCK. 1 mM spermine. pH 7.2. 

10 

Example 3: 

Binding of Poiyamidcs to HIV-l Promoter Sequences 

15 Sequence-specific DNA-biiiding small molecules that can permeate human cells could 

potentially regulate transcription of specific genes. Pyrrole-imidazole polyamides were 
designed to bind DNA sequences proximal to binding sites for the cellular transcription factors 
TBP and LEF-l utiHzed by HIV-l for RNA synthesis. These synthetic ligands inhibit HIV-l 
transcription in cell-free assays and virus replication in isolated human peripheral blood 

20 lymphocytes. The ability of small molecules to target predetermined DNA sequences located 
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within RNA polymerase II promoters siiiigests a general approach for regulation of gene 
expression, as well as a mechanism for the inhibition of viral replication. 

In Example 1, above, it was demonstrated that an eight-ring hairpin polyamide targeted 
to a specific region of the transcription factor TFIITA binding site inhibits 5S RNA gene 
transcription by RNA polymerase ill in Xeuopus kidney cells. See also Gottesfeld, J.M,, et al., 
Nature 387, 202 (1997). Using a similar approach, it has been found that polyamides can 
inhibit HIV- 1 transcription in cell-free assays and viral rephcation in human lymphocytes. 

The HlV-1 enhancer / promoter element contains binding sites for the cellular 
transcription I'actors Ets-1, LEF-1, NF-icB and SPl along with a canonical TBP binding site 
(TATA element) and an initiator sequence (Figure 1 1, taken from K. A. Jones & B. M. Peterlin 
Anmi. Rev. Biochem. 63, 717 (1994)). Jones & Peterlin (1994) describe the composition of the 
HIV-1 enhancer region sequence. Binding sites for such transcription factors are not optimal 
polyamide target sequences because tliey are found in the promoters of many protein-coding 
genes. However, the sequences immediately flanking these transcription factor binding sites 
vary from gene to gene. In some cases, transcription factor flanking sequences are conserved 
for a particular gene providing an address for gene-specific targeting of common transcription 
factors. For example, the sequences inmiediately upstream and downstream from the HIV-1 
TATA element are of the foiin 5*-VVGCWGCW-3' (where W = A or T). At least one copy of 
the 5'-WGCWGCW-3' sequence is located adjacent to the HIV-1 TATA element in all 
sequences of the FIIV-1 LTR found in the ElVLBL and Genbank data bases K. Freeh, R. Brack- 
Werner, T. Werner describe the common modular structure of viral LTR's. Virology 224, 256 
(1996) However, the 5'-TGCTGCATATA.AGCAGCT-3* TATA element is found only in 
strains and isolates of HTV-1 and in jio other sequence in either data base. The activity of the 
HIV-1 promoter, and especially Tat-induced activity, is critically dependent upon this 
sequence. For example, point mutations in the TATA flanking region reduces activated 
transcription by 10-fold without effecting basal transcription. Tat-transactivation is essential 
for HIV- 1 RNA synthesis and virus replication. B. Berkhout, K.-T. Jeang describe TAT 
induced expression of the HlV-1 long terminal repeat J. Virol 66, 139 (1992); H. S. Olsen, C. 
A. Rosen, describe contribution of the TATA n^otif to the TAT-mediated transcriptional 
activation of HlV-l gene expression. ./ ViroL, 66, 5594 (1992). M. R. Sadaie, T. Benter, F. 
Wong-Staal. describe site directed mutagenesis of the 2-trans regulatory genes of HIV-1. 
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Science 239, 910 (1988). Although the possibility of mutational change cannot be precluded, it 
would expected that targeting the HIV-1 TATA flanking sequence with a polyamide might 
prove effective for inJiibition of HIV-1 transcription and virus replication. A 5'-WGCWGCW- 
3' sequence which is not conserved in all strains of HIV-1 is located adjacent to the Ets-1 
5 binding site. An additional search identified a 5'-WGWWCVV-3' sequence adjacent to the 
binding site for LEF-1 which was conserved for most reported strains of HIV-1. In order to 
regulate pol II transcription of only HIV-l, polyamides were designed to bind immediately 
adjacent to rather than directly to the binding sites for TBP and LEF-1 (Figure 1 1). 

10 DNase footprint titrations showed that polyamide 1 bound to the site adjacent to the 

LEF-1 binding site with an apparent dissociation constant of 0.1 nM (Figure 12, site Pl-1). The 
labeled DNA was incubated with the following concentrations of polyamide 1 for 45 min prior 
to the addition of LEF-1 DBD to a final concentration of 8 nJNd (in the reactions of lanes 7 to 
14): no polyamide (lanes I and 7); 10 pM (lanes 2 and 8); 30 pM (lane 9); 0. I nM (lanes 3 and 

15 10); 0.3 nM (lane 1 1); 1 nM (lanes 4 and 12); 3 nM (lanes 5 and 13); 10 nM (lanes 6 and 14). 
After a second incubation period of 45 min, the samples were subjected to DNase I digestion 
and analyzed by gel electrophoresis. The location of polyamide sites Pl-1 to PI -4, LEF-1 sites 
L1-L3, the LEF-1 DNase hypersensitive (HSS) site and the start-site for transcription (+1) are 
shown along the side the autoradiograni. Three additional potential binding sites for polyamide 

20 1 are present in the HIV-1 promoter (5'-TGTACT-3') and coding sequence (5'-AGATCT-3') 
and polyamide 1 also binds these sequences (Pl-2 to Pl-4, Figure 12). In agreement with 
previous published studies, recombinant LEF-1 protein (recombinant DNA binding domain) 
bound to its target site with an apparent dissociation constant of 1.5 nM (as determined by 
DNase footprint titrations). Two additional LEF-1 sites are found in the HIV-1 promoter and 

25 coding sequences (sites L2 and L3, Figure 12). Importantly, polyamide 1 proved to be an 
effective inhibitor of LEF-1 binding to each of these sites with a Ki of 0.1 nM for the enhancer 
site. Importantly, inhibition was observed either when the polyamide was incubated with the 
DNA before adding LEF-1 or after preincubation of the DNA with LEF-l. In similar DNase 
footprinting experiments, the mismatcli polyamides did not inhibit LEF-1 binding even at a 

30 100- fold higher concentration that that needed for inhibition by polyamide 1. 
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Based on the results described above, it was expected that polyamide inhibition of LEF- 
1 binding would inhibit LtF-l -dependent transcription in an appropriate in vitro assay system. 
The effects of polyamide 1 and mismatch polyamide 2 on HIV-1 transcription were tested in an 
in vitro system consisting of a cell-fi ee extract prepared from cultured human lymphoid H9 

5 cells. This whole-cell extract contains high levels of LEF-1 protein but supports only low 
levels of transcription, suggesting a limitation for other transcription components in this 
extract. The H9 cell extract was supplemented with the HeLa cell-derived nuclear extract in 
order to obtain high levels of transcription. The H9 cell extract stimulates HIV-1 transcription 
^2.5 to 3-fold over the level of transcription obseived with the HeLa extract alone and 

10 immunodepletion of LEF-I protein from the H9 extract abolishes this activated transcription. 
Polyamide 1 is an effective inhibitor of HlV-l transcription in this system: 50% inhibition of 
transcription is obtained in polyamide titration experitnents between 10 and 30 nM polyamide 
1 in the reaction. Polyamide 1 fails to inhibit HIV-1 transcription in the LEF-1 -depleted 
extract or with the HeLa extract alone. Similarly, LEF-1 depletion had no effect on CMV 

15 MIEP transcription. The effects of the mismatch polyamide 2 on HIV-1 and CMV 
transcription were tested as additional controls. No potential binding sites for either polyamide 

I or 2 are present in the CMV MIEP sequence. As expected, polyamide 1 fails to inhibit CMV 
transcription and mismatch polyamide 2 fails to inhibit either HIV-1 or CMV transcription. 

20 Notably, HIV-1 transcription was not inhibited by polyamide 1 at concentrations as 

high as 1 uM even though binding sites for the polyamide occurred near the start-site for 
transcription (-2 to -7) and within the transcribed sequence (1-20 to +25). A polyamide was 
designed and synthesized ui inhibition studies with the human cytomegalovirus (CMV) 
immediate early promoter. A potential binding site for polyamide 3 (Figure 1 above) is located 

25 near the start-site for CMV transcription. Similar to our resulls with polyamide 1 and HIV-1 
transcription, polyamide 3 had no effect on CMV transcription }n our reconstituted assay 
system. These results with both the I il V-1 and CMV promoters suggest that RNA polymerase 

II can initiate transcription and read through DNA bound with a polyamide in the minor 
groove. Thus, a polyamide must intcrlere with an essential DNA-binding protein to be an 

30 inhibitor o f transcription. 
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Another well-studied minor groove DNA-binding protein is the TATA-box binding 
protein [ TBP]. X-ray crystallographic studies of the TBP-DNA complex have clearly shown 
that TBP binds DNA in the minor groove and that binding results in a severe distortion of the 
DNA duplex (unwinding and bending), 

5 

Since TATA elements are found in many protein-coding genes (with the exception of 
the so-cLiUed "liousekeeping" genes), the TATA element itself is not a good candidate for a 
polyamide target sequence. However, the sequences immediately flanking TATA boxes are 
unique lo each individual gene with little or no homology between genes. Thus, by analogy 
10 with our LEF-1 results, it would be expected that targeting the sequence immediately adjacent 
to a TA TA element with a polyamide might inliibit TBP binding and thus inhibit transcription 
of thai gene by interfering with assembly of the IINA polymerase IT transcription complex. 
Note that it is tlie binding of the TBP subunit of TFIID that nucleates the assembly of the RNA 
polymerase Tl transcription machinery for TATA-containing genes. 

15 

As a model system the polyamide 1 binding site was introduced immediately adjacent 
to the HlV-1 TATA element by oligonucleotide-directed mutagenesis. Thus the TATA region 
sequence was changed from 5'-CATATAAGCAGCT-3' (WILD-TYPE) to 5*- 
CATATA AGTACT T-3' (iVlUTANT, polyamide 1 site underhned). 

20 

1 he effect of the polyamide on the formation and stability of the TBP-DNA complex 
was determined using a gel mobility shift assay. As observed in previous gel mobility shift 
assays with recombinant TBP, both monomer and dimer TBP-DNA complexes are observed 
with both the wild-type and mutant TATA-box DNA fragments. Polyamide 1 inhibited TBP 

25 binding to the "mutant" HIV-1 promoter DNA probe but did not significantly inhibit TBP 
binding to the \\ ild-typc HlV-1 promoter probe (Figure 13 A). Similarly, polyamide 1 inhibited 
basal transcription from die "mutant" f [IV-1 promoter but not from the wild-type promoter in a 
reconstituted transcription system containing a HeLa cell nuclear extract (Figure 13B, below). 
Approximately 70% inhibition of basal RNA polymerase Il-mediated transcription was 

30 observed in this assay with polyamide 1 at a concentration of 100 nlv4. Additionally, neither of 
the misnratch polyamides inhibited TBP binding or transcription. 
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Based on these results, another polyamide was designed and synthesized to specifically 
target the HIV-1 TATA element as described in Example 2 (Figure 14, below): As shown 
below, this poh amide (ImPy-(3-hnPy-Y-ImPy-P-ImPy-p-Dp, called HIV-1 in Figs. 14-116 and 
polyamide 1 in Figs. 18-24) was designed to recognize the sequences immediately 5' and 3' to 
5 the HlV-1 TA TA element. The recognition sites for polyamide HIV-l are of the form 5'- 
WGCWGCW-3" (where W = A or T). Since a pyrrole-pyrrole pair can recognize either an A-T 
base pan* or a T -A base pair, polyamide HIV-1 will recognize both the 5' and 3* flanking 
sequences of the HIV-1 TATA elemejit. Binding of the polyamide was confirmed by DNase I 
footprinting and an apparent dissociation constant of 0.05 nM (Ka = 2 X 10^^ M"^) was 

10 determined for both sites (Figure 14). This polyamide inhibits HIV-1 transcripdon in an in 
vitro transcription assay with the lymphoid cell nuclear extract and a wild-type HIV-1 
promoter but not transcription of a control template lacking the binding site (the CMV 
immediate early promoter). A quantitative representation o( the data is shown in Figure 15. 
Fifty percent inhibition of HIV-1 transcription is observed at approximately 50 nM polyamide 

15 in the reaction. This concentration corresponds to only a 16-fold excess of polyamide over 
specific binding sites in the HIV-1 plasmid DNA (3 nM). 

A corresponding control polyamide was also synthesized (Imlm-P-Imlm-Y-PyPy-p- 
PyPy-(^Dp, named HIV-l in Figure 16 and polyamide 2 in Figures 18-24). This polyamide 

20 differs from the HIV-l polyamide only in the placement of tlie imidazole and pyrrole amino 
acids aixi recognizes the TATA box region of the HIV-1 LTR with at least 100-fold reduced 
affinity relative to the HIV-l polyamide (Kq = 2 X 10 ^ M"*). In Figure 16, the structures of 
polyamidcs HIV-1, ImPy-p-ImPy-y-lmPy-P-ImPy-p-Dp and IIIV-2, Imlm-p-Imlm-y-PyPy-P- 
PyPy-P-Dp, arc shown along with polyamide binding models for the HIV-1 TATA box region, 

25 The filled and unfilled circles represent imidazole and pynolc rings, respectively, the curved 
line represents y-aminobutyric acid, and the diamond represents P-aianine. Single hydrogen 
bond mismatchL's are highlighted. 

As noted above, the sequences immediately flanking TATA boxes are unique to each 
30 individual gene with little or no homology between genes. The sequences immediately 5' and 
3* to the HlV-1 TATA clement are of the form 5'-\VGCWGCW-3' (where W = A o r T). A data 
base search revealed that at least one copy of the 5'-WGCWGCW-3' sequence is located 
adjaceni to the 1 llV-1 TATA clement in all sequences of the HIV-l LTR found in the EMBL 
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and GenBank data bases. Furlhermorc the 5'-TGCTGCATATAAGCAGCT-3' TATA element 
and the other prevalent HIV-1 TATA clement (5'-TGCTGCATAAAAGCAGCC-3') are found 
only in the various strains and isolates of HIV-1 and in no other reported sequence in either 
GenBank or the EMBL data base. 

5 

These polyamides were next used in DNase I footprinting experiments and gel mobility 
shift asseiys with TBP and an IIIV-1 LTR restriction fragment or TATA box oligonucleotides 
(as described above). As expected, the match polyamide HIV-l inhibited TBP binding to the 
HIV-1 TATA element but the mismatch polyamide HJV-2 did not inhibit TBP binding to this 

10 same DNA fragment (Figure 19A). DNase [ footprint titrations of polyamides HIV-1 and -2 
v^ere prefonned in the presence or absence of recombinant human TBP (rhTBP) (- or + TBP). 
A radiolabeled HIV-1 LTR restriction fragment was incubated for 30 min with the following 
concentrations of polyamides prior to addition of rhTBP where indicated: no polyamide, lanes 
2, 6, 10, 14; 2.5 nM polyamide, lanes 3, 7, 11, 15; 5 nM, lanes 4, 8, 12, 17; 10 nM, lanes 5, 9, 

15 13, 17. After an additional 30 min incubation, samples were digested with DNase I. G+A 
chemical sequencing reactions are shown in lanes 1 and 18. The extent of the footprints 
generated by polyamide HIV-l and TBP, respectively, are indicated at the sides of the 
autoradiogram. 

20 Complete inhibition of TBP binding was obsei-ved at a polyamide concentration of 2.5 

nM, which represents only a five-fold excess of polyamide over DNA binding sites in this 
assay. The HIV-1 polyamide did not inhibit TBP binding to an oligonucleotide corresponding 
to the TATA box region ot^ the adenovims major late promoter (data not shown). The half-life 
of the HIV-1 polyamide was determined in an experiment in which the polyamide-DNA 

25 complex was fust formed and then challenged with a large molar excess of unlabeled DNA. 
Samples were taken for digestion with DNase after various incubation times and the apparent 
half-life of the HIV-1 poiyamide-DNA complex is in excess ot^2.5 hours. 

I' igure 17 illustrates the binding of several classes of polyamides to the DNA sequence 
30 adjacent to the HIV-1 TA l A box (Figure 17, sections la-lc) and adjacent to the Ets-I / LEF-1 
binding sites. As described above, the polyamide lIlV-1 (Figure 17, la) is a hairpin molecule 
that binds to a seven base i^air sequence of the fomi S'-WGC \VGCW-3' (where W = A or T). 
A longer twelve base prnr sequence can be bound by an antiparallel dimer of the non-hairpin 
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polyamide IniPy-p-PyPyPyPy-p4mPy-13-Dp (Figure 17, lb). An even longer sixteen base pair 
sequence can be bound by an antiparallel dimer of the hairpin polyamide ImPy-p-ImPy-p- 
PyPyPyPy-(3-]mPy-P-lmPy-P-Dp (Figure 17, Ic). Similarly, a six base pair sequence between 
the Ets-l binding site ajul ihc LEF-1 binding site can be bound by the polyamide ImPyPyPy-y- 
5 ImPyPyPy-P-Dp (Figure 1 7, 2a). The same sequence plus a portion of the LEF-1 binding site 
extending a total of nine base pairs can be bound by the polyamide ImPy-P-ImPyPyPy-y- 
ImPyPyPy-p-PyPy-p-Dp (Figure 17, 2b). 

Example 4: 

10 Inhibition of TBP Binding by Polyamide Ligands 

Figure 18 schematically depicts the FIIV-1 promoter and DNA-binding sites for 
polyamide hairpins designed to target the HIV-1 promoter. In Figure 18A, the DNA binding 
sites for hairpin polyamides designed to target the HIV-1 enliancer and promoter are illustrated, 

15 showing the binding of the polyamide ImPyPyPy-y-ImPyPyPy-P-Dp, here designated 
polyamide 3, also labeled above as polyamide 1 in Figures 1-4 and 12-13. As before, in the 
binding model illustration, the shaded and unshaded circles represent imidazole (Im) and 
pyrrole (Py) rings, resj^eclively, the curved lines represent y-aminobutyric acid (y), and the 
diamonds represent p-alanine (P). The shaded bar schematic of the HIV-1 enhancer and 

20 promoter shows nucleotide positions -170 to the transcription start site at +1. The binding sites 
for the transcription factors USF, Ets-l, LEF-1, NF-kB, SPl and TFIID (TBP) are indicated. 
The binding sites for polyamides 1, called HIV-1 above, and 3, called HIV-l above, are 
indicated. The structures of polyamides ImPy-P-ImPy-y-ImPy-P-ImPy-P-Dp (1), Imlm-P- 
Imlm-y-PyPy-p-PyPy-p-Dp (2), ImPyPyPy-y-ImPyPyPy-p-Dp (3), and ImPyPyPy-y- 

25 PyPyPyPy-P-Dp (4), are shown in Figure 18 B, where Dp = dimethyaminopropylamide. 
Binding models and measured dissociation constants (K^/) arc shown. Mismatched base pairs 
are highlighted. 

ll is known that binding of the TBI^ subunit of TFIID in the minor groove nucleates 
30 assembly of tlic RNA polymerase II transcription machinery for TATA-containing genes. In 
general- S. K. Burley, R. G. Rocder describe the biochemistry and structural biology of the 
transcription factor TFIID Ann. Rev, Biorhem. 65, 769 (1096); C. P. Verrijzer, R. Tjian 
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describe TAFS which mediate transcription activity and promoter selectivity TIBS 21, 338 
(1996): R. G. Jloeder describes the role of general initiation factors in transcription by RNA 
polymerase 11 TIBS.. 21. 327 (1996). 

5 According to the pairing rules for DNA recognition, the 5'-WGCWGCW-3* sequence 

adjacent to the HIV-1 TATA element is targeted by hairjMii polyaniide 1 having sequence 
composition hnPy-P-lmPy-y-ImPy-P-hnPy-p-Dp (Figure 18). The imidazole-pyrrole pair 
(Im/Py) targets a G*C base pair and Py/Im targets C»G. Since the (i/p pairing recognizes both 
A*T and T«A base pairs polyamide 1 is expected to bind both the 5' and 3' flanking sequences 

10 of the lllV-1 TATA element (Figure ISA). White, S., Baird, E. E. & Dervan, P. B. describes 
the pairing rules for recognition in the minoi- groove of DNA by pyn*ole-imidazole polyamides. 
Chem. & BioL 4, 569-578 (1997); Swalley, S. E., Baird, E. E. & Dervan, P. B. describe the 
sequence specificity of the p/p pairing Cliem Eur. J. 3, 1600 (1997). Quantitative footprint 
titration experiments re\'eal that polyamide I binds both TBP sites as well as a 5'-AGCTGCA- 

15 3' match site overlappmg the binding site for Ets-1 with an equihbnum dissociation constant 
(K^/) of 0.05 nM (Figure 19A, lanes 3-5). For DNase I footprinting a 400--bp restriction 
fragment containing the 1 lLV-1 enhancer and promoter regions was isolated from pHIV LTR- 
CAT plasmid DNA (obianied from Dr, K. A. Jones of The Salk Institute, La Jolla, CA) (P.L. 
Sheridiui et aL. describe the sequence composition of the plasmid pHlV LTR-CAT Genes Dev. 

20 9, 2090 (1995)). Labeled DNA was incubated with polyamide (45 minutes) and digested with 
DNase 1 under single-hii conditions. 

Regions of polyamide protection were determined by analysis of the digestion products 
on a 6'lii denaturing poiyacrylaniide sequencing gel. Storage phosphorimage analysis of the 

25 data was used to obtain dissociation constants for the binding reactions (using Kodak Storage 
Phosphor Screens and a Molecular Dynamics SF Phosphorlmager). Recombinant human TBP 
(rhTBP) was obtained from Promcga and was used in footprinting and gel mobility shift 
experiments as recomjiicnded. The reaction mixtures also contained 100 ng of poly dG-poly 
dC per 10 uL gel shift icaction or 50 j.iL footprinting reaction. For gel mobility shift assays, 

30 6% polyacrylamide gels (20:1, acrylamidc to bisacrylamidc) contained 44 mM Tris-borate, pH 
8.3, 1 niM ED TA, 4 niM MgCb and 0.02% (v/v) NP-40. The same buffer was used for 

electropiioresis of 1 mni thick gels at 1 5 volis/cm for 2 hours at 4 *^C. 
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'The UlV-l TATA region double-stranded oligonucleotide, used as probe for the gel 
shift assay, had the top-strand sequence: 5'- 

CGTCCCTCAG ATGCTGCATATAAGCAGCTGCTTTTTGCCTGT ACTGGGTC-3 and the 
complementary bottom-strand sequence. The adenovirus major late promoter TATA region 
5 ohgonucleotide has the top-strand sequence 5'-GATCGGGGGCTATAAAAGGGGGTGGG- 
3\ and ihe complemeniai y bottom-strand sequence. Each DNA strand (50 ng) was labeled in 

32 

separate reactions with y- P-ATP and T4 polynucleotide kinase and the double-stranded 
oligonucleotides were prepared by annealing equal molar amounts of the two complementary 
strands. Gel shift reactions contained 15 fmol of doable stranded oligonucleotide (1.5 nM) 
10 and, where indicated in the figure legends, 50 pg of rhTBP (0.14 nM). Footprinting reactions 
contained the HIV-1 DNA probe (0.5 nM), and polyamide and 5 ng of rliTBP (3 nM) as 
indicated. 

The inliibition of TBP binding to the HIV-1 TATA element by polyamide 1 is shown in 
15 Figure 19. Figure 19A shows DNase I footprint titrations of polyamides 1 and 2 in the 
presence or absence of rhTBP. The radiolabeled HIV-1 LTR restriction fragment was 
incubated for 30 min w ith the following concentrations of polyamide 1 or 2 prior to addition of 
rhTBP w here indicated (+): no polyamide, lanes 2, 6, 10, 14; 2.5 nM polyamide, lanes 3, 7, 11, 
15; 5 nM, lanes 4, 8, 12, 16; 10 nM, lanes 5, 9, 13, 17. After an additional 30 min incubation, 
20 samples were digested wi(h DNase I. G+A chemical sequencing reactions are shown in lanes 1 
and IS. The location of tlie transcription start site, and the exieiit of the footprints generated by 
polyamide 1 and TBP (+1), are indicated at the sides of the autoradiogram. Binding sites for 
polyamide 1 are located adjacent to the TATA element and additional sites are found 
overlapping the binduig sue for the transcription factor Ets-1 (Figure 18A) and in the vector 
25 DNA sequence (upstream of position -170). 

A mismatch control polyamide Imlm-p-lmlm-y-PyPy-fi-PyPy-p-Dp (2) which differs 
only in ihe placement of imidazole and pyrrole amino acids binds the HTV-1 TATA box region 
with 100-fold reduced affinity relative to polyamide 1. TBP binds the HIV-1 TATA element 
30 with a of -^1-3 nM. DNase footprinting assays reveal that polyamide 1 inhibits TBP 
binding at a concentration of 2.5 nM, which represents only a five-fold excess of polyamide 
over DNA binding sites in this assay (Figure 19A., lane 7). Mismatch polyamide 2 fails to 
inhibit '! BP binding (Figure 19A., lanes 15-17), 
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A gel mobility shili assay whicli clisiinguishes the TBP-DNA complex from free DNA 
and polyamide-DNA complexes was carried out. The results are illustrated in Figure 19B. 

5 Radiolabeled HIV-1 l ATA box (lanes 1-5 and 11-15) or adenoviins major late promoter 
TATA box duplex oligonucleotides (lanes 6-10) (11) were incubated with the following 
concentrations of polyamide prior to addition of rhTBP, where mdicated (+): no polyamide, 
lanes 1, 2, 6, 7, II, 12; 50 nM polyamide, lanes 3, 8, 13; 100 nM, lanes 4, 9, 14; 200 nM, lanes 
5, 10, 15. After an additional 30 min incubation, samples were subjected to electrophoresis. 

10 The positions of free DNA probes (F) and the monomer and dimer TBP-DNA complexes (B) 
are indicated alongside the figure. Polyamide 1 inhibits TBP binding to a double stranded 
oligonucleotide corresponding to the HlV-1 TATA box region, while no inliibition is observed 
for control polyamide 2 (I igure 13B). Additionally, polyamide 1 does not inhibit TBP binding 
to the TATA box region of the adenovirus major late (AdML) promoter (5'- 

15 GGGGGCTAT A A AAGGGGGT-3 ') whic h contains mismatch (underlined) flanking 
sequences. The half-life of the polyamide 1-DNA complex was determined by competition 
experiments to be in excess of 2.5 hours. 

20 Example 5: 

inlitbition of LEF-1 binding. 

Polyaniides were designed and synthesized to target a sequence adjacent to the binding 
site for LEF-L a cellular iranscriptional activator used by HIV-1. LEF-1 is a member of the 
HMG family o( minor-groove binding proteins. J. J. Love, et al. describe the structure of the 

25 LEF-LDNA complex. Na/ure 376, 791 (1995); J. Kim, F. Gonzales-Scarano, S. Zeichner, J. 
Alwinc. Describe replication of HIV-l containing linker substitution mutations in the -201 to - 
130 region of the long terminal repeat J, Virol 67, 1658 (1993);. In addition to acting as an 
architectural transcription factor, LEF-1 possesses a strong //r//?5-activation domain and has 
been shown to he essential for viral transcription and replication in lymphoid cells. The LEF-1 

30 binding site is immediaicl\ flanked on one side by the sequence 5'-AGTACT-3'. According to 
the poh amide pairing rules, this sequence should be bound by hairpin polyamide 3 of sequence 
composition lmPyPyPy-y-[mPyPyPy-P-Dp which has already been characterized. Trauger, 
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J.W., Baird, E. E. Dcrvaii, P.B. describes the recognition of DNA by designed ligands at 
subnanomokir concentrations. Nature 382, 559-561 (1996). 

Quantitative footprint titration experiments reveal that polyamide 3 binds the 5*- 
AGTAC'T-3' site adjacent to the LEF-1 binding site with K^/ of about 0.06 mVI. (site P3-1 in 
Figure 20A). DNase 1 footprint titration experiments were performed as described for TBP 
above, fhree additional polyamide 3 landing sites present in the HlV-1 enhancer/promoter 
restriction fragment were bound with about 1 nM: 5'-TGTACT-3' (located at -7 to -2, P3-3 
in Figure 20A); 5'-AGATCT-3' (located at +20 to +25, P3-4 in Figure 20A) and a single base 
mismatch site 5'-TCTACA-3' (located at -106 to -111, P3-2 in Figure 20A). DNase I 
footprinting failed to detect any high affinity sites for mismatch polyamide 4 on the HIV-1 
enhancer/promoter fragment (Figure 20B), Three sites on the HIV-1 promoter/enhancer 
restriction fragment are bound by the 86-amino acid LEF-1 DNA binding domain (DBD), sites 
LI, L2, and L3 (Figure 20A) with Krf= L4, 5.8, and 4.9 nM, respectively. Recombinant LEF-I 
protein containing the 86 amino acid DNA-binding domain (DBD) was the generous gift of J. 
Love (Scripps Research histitute) and was expressed and purified as described. J. J. Love, et al, 
describe the stnicture of the LEF-1 -DN/\ complex. Nature 316, 791 (1995). LEF-1 was diluted 
into a buffer containing 10 niM Hepes-OH, pH 7.5, 100 mM KCl, 1 mM dithiotlireitol, 1 mM 
MgCh. 10% (v/v) glycerol and 250 [ag/niL BSA. 

DNA binding reactions were perforaied in the same buffer (without BSA) and also 
contained 250 ng of poly dG-poiy dC and the HIV-1 DNA probe at 50 pM in a final volume of 
50 ixL For assessing LEF-1 occupancy by DNase footprinting, the intensity of a protein- 
induced DNase I hypersensitive site was used as a hallmark of LEF-1 binding. The LEF-1 
footprint at site LI, characterized by a marked DNase I hypersensitive site (HSS), clearly 
changes to the polyamide footprint in ihe presence of match polyamide 3. LEF-1 binding is 
inhibited 50% at a polyamide 3 concentration of approximately 60 pM. Thus, polyamide 3 
located in the minor groove immediately adjacent to site LI inhibits LEF-1 DBD binding to 
this site. Polyamide 3 only inhibits LEF-1 binding to sites L2 and L3 at markedly higher 
polyamide concentrations (3 nM and above, lanes 13-14). Inhibition was obsei-ved either by 
adding the polyamide to the DNA before LEF-1 or after preincubation of the DNA with LEF-1, 
consistent with the relalixe dissociation constants for the two binding reactions. Mismatch 
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polyamidc 4, which buids to the 5'-AGTACT-3' sequence with >]00-folcl reduced affinity, 
fails to inhibit LEF-1 binding (Figure 20B & C). 



The inhibition of LEF-i binding to tiic HIV-1 enhancer by polyamide 3 is shown in 

5 Figure 20. hi Figure 2()A, the S'-^-'P end-labeled HIV-l LTR restriction fragment was 
incubated wnth 1 0 pM - 1 0 aM polyamide I for 45 min prior to the addition of LEF-1 DBD to a 
final concentration of S niVI (lanes 7 to 14). After an additional 45 min, the samples were 
subjected to DNase 1 digestion and analyzed by gel electrophoresis. The polyamide 
concen nations were: no polyamide (lanes 1 and 7); 10 pM (lanes 2 and 8); 30 pM (lane 9); 0.1 

10 nM (lanes 3 and 10); 0.3 nWl (lane 11); 1 nM (lanes 4 and 12); 3 nM (lanes 5 and 13); 10 nM 
(lanes (> and 14). In Figure 20 B, mismatch polyamide 4 does not bind to the the HIV-1 
enhancer/promoter. The labeled DNA was incubated with the following concentrations of 
polyanndes for 45 min; no polyamide (lane 1), 10 nM polyamide 3 (lane 2), 1, 3, 10, and 60 
nM polyamide 4 (lanes 3-6). In Figure 20C, mismatch polyamide 4 does not inhibit LEF-1 

15 binding to the I IlV-l enhancer/promoter. The labeled DNA was incubated with the following 
concentrations of polyamide 4 ibr 45 min prior to the addition of LEF-1 DBD: 1 nM (lanes 1); 
3 nM (lanes 2); 10 nM (lanes 3); and, 30 nM (lanes 4). LEF-1 DBD was added to a final 
concentration of 8 nM and, after an additional 45 min incubation, the samples were subjected 
to digestion with DNase L The location ofpolyannde sites P?-l to P3-4, LEF-1 sites LI to L3, 

20 the LEF-1 HSS site and the start-site for transcription (+1) are shown alongside the 
autoraclioizram. 



Example 6: 

25 Inhibition of RN A Polymerase II Transcription 

b> Synthetic DNA-Binding Lifjunds 

The effects of polyamides 1 and 2 on HIV-1 transcription were tested in an in vitro 
transcription assay with a HeLa cell nuclear extract (Figure 21A). The effects of the 
30 polyamides on basal transcription by RN A polymerase 11 was monitored with a nuclear extract 
prepared irom I fcLa cells (.1. D. Dignani, P. L. Martin, B. S. Shastry, R. G. Roeder, Describe 
eukai70tic uene transcription w ith purified components Meth. Enzynioi lOL 582 (1983)). The 
human lymphoid cell line H9 (ATCC MTB 176) was grown in suspension culture in RPMl 
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medium (Bio Whittakcr) supplemented with 10% fetal calf scrum (Tissue Culture Biological). 
Whole cell extracts of H9 cells in log phase growth were prepared by hypotonic lysis and 
contained 20 mg/ml protein, as deteimijicd by the Bradford reaction. Run-off RNA transcripts 
of -^300 bases (CMV MTIZP linked to a guanine-less cassette, plasmid pElB-GL and -500 
bases (pinv LTR-CAT) were obtained with EcoRI-digested plasmid DNA (100 ng per 25 ^iL 
reaction). 

Polyamide-DNA complexes were allowed to fonii at ambient temperature for 30 min 
prior to addition of the nuclear extract. Transcription complexes were allowed to form for 1 h 

at 30 prior to a transcription step for 1 h at 30 with 10 )aCi of a--^^P-ATP, 10 \iM 
unlabeled ATP, and 600 liM oI the remaining unlabeled nucleoside triphosphates. RNA was 
purified b\ extraction with RNAz,ol (TelTest, Friendsvvood, TX) and analyzed by 
electrophoresis on a denaturing (8.3 M urea) 6% polyacrylamide gel. Auto radiograms were 
obtained by exposure of the dried gel to Kodak BioMax film with DuPont Cronex Lightning 

Plus intensifying screens for 1 to 18 h at -80^C. Relative levels of transcription were estimated 
by storage phosphorimage analysis. 

Polyamide 1, which targets the 1 llV-1 TATA box, inhibits basal transcription from the 
HIV-1 promoter mediated by the general transcription factors SPl and RNA polymerase II 
(Figure 21 A, lanes 2-4). HIV-1 transcription was inhibited 50% in the presence of between 50 
and 100 nM polyamide 1 in several independent experiments. Polyamide 1 docs not inhibit 
transcription from the CMV major intermediate early promoter (MIEP), which contains a 
mismatched Ty\TA-flanknig sequence (5'-GAGGTCTATATAAGCAGA-3')- The mismatch 
polyamide 2 does not iiihibit transcription from either promoter (Figure 21 A). Titrations were 
performed of polyamide 1 over a wide range of concentrations (1 to 200 nM) with both the 
HIV-1 and CMV^ templates in the same reaction. Under these conditions, 50% inhibition of 
HIV-1 transcription was observed at 30 nM polyamide 1, which corresponds to a 3-5 fold 
excess of polyamide over binding sites. Ko inhibition of CMV transcription was observed even 
at the highest concentrations of polyamide 1 (Figure 21B). Polyamide 1, which has an 
additional binding site in the HlV-1 enhancer at nucleotide positions -153 to -147 (overlapping 
the Ets-I site lU -149 lo -142), inhibits the DNA binding activity of the Ets-1 transcription 
factor. 
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'Yhc efiects of polyamiclcs 3 jikI 4 on HIY-1 transcription in an /// vitro system 
consisting of a cell-free extract prepared from cultured human lymphoid H9 cells supplemented 
with HeLa cell extract w crc determined. The H9 extract contains high levels of LEF-1 protein 
but was found to support only low levels of transcription, suggesting a limited amount of other 
5 transcription components in this extraci. The H9 cell extract was supplemented with small 
amounts of a HeLa cell-derived nuclear extract in order to obtain high levels of transcription 
(Figure 21C, lane I). The K9 HeLa cell extract stimulates IIlV-l transcription 2.5-3-fold over 
the level of transcription observed with the HeLa extract alone. Immunodepletion of LEF-1 
protein from the H9 extract abolishes this activated transcription (Figure 21C, compare lanes 1 
10 and 6). 

For immunodepletion and western blot analysis the H9 cell extract was depleted of 
LEF-1 protein with antibody to L.EF-l pre-bound to protein A sepharose beads. A mixture of 5 
^1 of antisei uni and 50 [tl of a 1:1 (v/v) slurry of protein A sepharose in transcription buffer 
15 supplemented with 2.5 nig/ml bovine serum albumin (BSA) was incubated on a rotator at 4 

for 1 hour. For mock immunodepletion, an equivalent volume of beads was treated identically 
without antibody. Unbound antibody and BSA were removed by brief centrifugation and the 
beads were washed thice times with 50 \.[\ of transcription buffer. For immunodepletion, the 
packed protein A beads were mcubalcd with 10 |li1 of H9 cell extract, 45 |.d of transcription 

20 buffer and 7.5 id of 50 miVI iMgCb on a rotator at 4 lor 1 hour. The beads were then 
pelleted by brief centrifugation and the supernatant was tested for transcription activity. The 
efficiency of immunodepletion was determined by subjecting the depleted and mock-depleted 
extracts to SDS-PAGE and western blotting and was found to be greater than 95 %. The blot 
was probed with antibody to LEF-1 (diluted 1:2500, kindly provided by K. Jones, Salk 

25 Institute. La Jolta, California) and detected by enhanced chcmiluminescence (Amersham ECL 
kit). Kodak Bio-Max film was used for detection. 

Pol>'amide 3 inhibits II iV-] transcription in this system (lanes 2-5) with a 50% 
reduction of transcription ooservcd at 10-30 nM polyamide. Polyamidc 3 fails to inhibit HIV-1 
30 transcription in the LEF- 1 -depicted extract (lanes 7-10). The activity of the CMV MIEP was 
observed in both the mock-depleted and LEF-1 depleted H9 cell extract, with the result that 
LEF-1 depletion had no effect on CMV transcription. The effect of the mismatch polyamide 4 
on HlV-1 transcription (J-iguie 21 D. lanes 4 and 5) and polyanndes 3 and 4 on CMV 
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transcriplioii (Figure 2 ID, lanes 6-10) were examined as addilional controls. No potential 
binding sites for either polyamide 3 or 4 arc present in the CMV MIEP sequence. As expected, 
polyamidc 3 fails to inhibit CMV transcription (Figure 2 ID, lanes 7-8). The mismatch 
polyamide 4 fails to inhibit either HIV-1 or CMV transcription (lanes 4-5 and 9-10, 
5 respectively). 

In Figure 21 A the inhibition of basal transcription with polyamide 1 is demonstrated. DNA 
templates containing the HIV- 1 promoter (lanes 1-4, 9-12) and the CMV major immediate early 
promoter (hmes 5-8, 13-16) were incubated with the following concentrations of polyamide 1 

10 (lanes 2-4, 6-S) or polyamide 2 (10-12, 14-16) prior to the addition of a HeLa cell nuclear extract 
(16): no polyamide, lanes 1, 5, 9, 13; 50 nM polyamide, lanes 2, 6, 10, 14; 100 nM, lanes 3, 7, 11, 
15; 200 nM, lanes 4, 8. 12, 16. In Figure 21B the relative levels of HIV-1 transcription/CMV 
transcri[nion are plotteci as a function of polyamide 1 concentration. Data were obtained from 
mixed template reactions containing both DNA templates as described in Figure 21A. In Figure 

15 21 C the inhibition of polyamide 3 on LEF-1 -activated transcription is shown. Transcription 
reactions were performed with EcoRI-digested plasmid pHlV LTR-CAT and the H9 whole cell 
extract (2 ul) supplemented with 1 of the HeLa nuclear extract (lanes 1-5) or with 2 |xl of the 
LEF-dej^leted H9 extract and the HeLa extract (lanes 6-10). Plasmid DNA was incubated with 
polyamide 3 for 15 min pviov to addition of cell extracts and other reaction components. The final 

20 concentrations of polyamide in the reaction were: no polyamide (lanes 1 and 6), 10 nM (lanes 2 
and 7), 30 nVT (lanes 3 and 8), 100 nM (lanes 4 and 9) and 300 uM (lanes 5 and 10). In Figure 21 
d, transeripiion with mismatch control polyamide 4 and a control CMV template. Plasmid DNA 
was incubated with polyamide 3 (10 nM, lanes 2 and 7; 100 nM, lanes 3 and 8) or polyamide 4 (10 
nM, lanes 4 and 9; 100 nM, lanes 5 and 10) for 15 min at ambient temperature prior to addition of 

25 cell extracts and other reaction components. 

Notably, polyanifde 3 does not inhibit basal transcription with the HeLa nuclear extract 
although binding sites for this polyamide are present at the start-site for transcription and 
within the HfV-l mRN,\ coding sequence present in plasmid pHTV-CAT, These observations 
30 suggest that RNA polymerase 11 can transcribe DNA witli a polyamide bound in the minor 
groove and that polyamides arc only inhibitory to transcription when these compounds 
interfere with the DNA binding activity of a required transcription factor. 
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Example 7: 
Iiihibllion of RNA Pol II Transcription in 
Iluinan Cells by Synthetic DNA-Biutling Ligands 

5 The polyamicles of the present invention have been demonstrated to inhibit HIV-1 

transcription in viriis-iniccted human cells. In general, M. E. Klotman, S. Kim, A. Buchbinder, 
A. DeRossi, D. Baltimore, F. Wong-Staal describe the kinetics of expression of multiply 
spliced RNA in early I l]\/-1 infection of lymphocytes and monocytes, Proc. Natl Acad. ScL 
U.S.A, 88, 5011-5015 (1991). Since there are multiple spliced and unspliced species of HIV-1 

10 RNA with different turn-over kinetics, viral transcription was not monitired by measuring 
levels of RNA. Insteail, tlie effects of the polyamides on the levels and kinetics of HIV-1 
replication in isolated luiman peripheral blood mononuclear cells (PBMC) was observed in 
culture. PBMC were infected with the T cell-tropic HlV-1 strain WEAU1.6, or the 
macrophage-tropic strain SF162. S. J. Clark et al describe high liters of cytopathic virus in 

15 plasma of patients with symptoms of primary human imunodeficiency virus type-1 infection. 
A^. Engl, J. MecL 324, ^>54 (1991); P. Borrow, et al., describe antiviral pressure exerted by 
HIV-1 specific cytotoxic T-lymphocytes (CTLS) during primary infection demonstrated by 
rapid selection ofCTL escape virus Nature Med 3, 205 (1997); C. Cheng-Mayer, D. Seto, M. 
Tateno, J. A. Levy describe biologic features of HIV-l that correlate with virulence in the 

20 hosi.Sciencc 240, 80 (19SS); C. Cheng-Mayer, C. Weiss, D. Seto, J. A. Levy, Describe isolates 
of HTV-1 from the brain may not constitute a special group of the AIDS virus. Proc. Nad. 
Acad. Sci. USA 86, 8575 (1989). Polyamides were added to the culture medium and the levels 
of HIV-1 p24 viral capsid protein in the culture media wore determined on subsequent days 
using a standard ELISA assay. 

25 

Human PBMC were separated from whole blood collected from normal adult 
volunteers by density gradient centrifugation as described. D. E. Mosicr, et al, describe HIV-1 
infection of human-PBL-SClD mice Science 251, 791 (1991); D. Mosier, R. Gulizia, P. 
Maclsaac, B. Torbett, ,1. Levy, describe rapid loss of CD4+ T-cells in human-PBL-SCID mice 
30 by noncylopathic HIV isolates Science. 260, 689 (1993). Donors were provided by the General 
Clinical Research Center of The Scripps Research Institute, which is supported by NIH grant 
MOl RR00833. Human PBMC were activated with 2 ^g/mL PllA and 20 units/mL of IL-2 

for 2-3 days pnor to Hi\'-1 infection. Eacti culture of 5 x 10^ PBMC was infected with 10^ 
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tissue culture infectious doses of HIV- 1 for 24 hours. Free virus was removed by washing the 
cells in medium, and poiyamides added to the culture. In the experiment shown in Figure 22C, 
polyamidcs were added 24 hours prior to vims exposure, and were continuously present 
thereafter. Vh'us replication in culture was measured by HIV-1 p24 viral capsid antigen 
5 ELISA (DuPont Medical Products, Boston, MA). 

Assays of HIV-1 replication representative of five replicate experiments with five 
human PBL donors are shown in Figures 22 and 23, illustrating results from two experiments 
employing different dojiors. Each experiment is controlled by using poiyamides which differ 
10 by single atomic substiuuion from the inliibitory poiyamides. In control PBMC cultures with 
no added polyamidc, viral replication resulted in increasing p24 levels between day 4 and day 
10 of culture (Figure 23 ). Addition of mismatch control poiyamides 2 and 4 had no effect on 
the level of virus in the medium, either alone (Figure 22, A, B, & D) or in combination (Figure 
22C). 

15 

la contrast, both poiyamides 1 and 3 added to the culture medium reduced p24 levels in 
a dose-dependent manner (Figure 22A & B). Polyamide 1 at 1 \.iM concentration caused an 
80% reduction in virus ( Figure 22 A), while polyamide 3 at 10 |.iM concentration caused a 60% 
reduction (Figure 22B). These inliibitory effects were clear at 6 or S days of culture, but 
20 became less pronounced at later times. The two polyamide ligands, when used individually, 
delay the appearance of virus rather than absolutely inhibit virus production. 

The combination of poiyamides 1 and 3 at 1 pM each can act in synergy to reduce viral 
p24 levels to below the threshold of detection (<10 pg/ml; greater than 99.9% inhibition of 
25 viral replication) (Figure 22D, 23A & C), and were as effective as 1 laM azidothymidine (AZT) 
in blocking HIV-1 replication (Figure 22D). The macrophage-tropic SF162 isolate, which 

replicates in both macrophages and CD4'^~ T lymphocytes, was more difficult to inhibit with 
single poiyamides, but ihc combination of 1 \.iM poiyamides 1 and 3 was able to reduce and 
eventually block its replication (Figure 23A). These results demonstrate that cell-permeable 
30 synthetic DNA hgands can effectively inliibit FllV-1 replication in isolated human 
lymphocytes in culture and suggest that inhibition of virus replication is due to inhibition of 
transcription factor-DNW interactions and gene transcription of viral and possibly cellular 
genes by RNA polymerase II. The observed polyamide inliibition of \'irus replication is likely 
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due to interference wiili ilic DNA binding activities of TBP and Ets-1 by polyamide 1 and the 
binding aciivity of LEF-l by polyamide 3. The inhibitory effects of polyamides singly or in 
combination was not ditc to obvious toxicity. No significant decrease in cell viability was 
apparent in PBMC cultures treated with polyamides 1 and 3 for 10 days, in contrast to 90% 
5 mortality observed for PBMC cells treated with 1 \iM AZT for the same period (Figure 23B & 
D). Cell viability was not impacted by polyamide treatment, but was reduced by AZT 
treatment. 

The combination of polyamides 1+3 inliibited HlV-1 replication, but the closely 
10 related polyamides 2 + 4 did not. The simplest explanation for this finding is that polyamides 
1 + 3 inhibited HIV-1 RNA transcription in cells as well as in the in vitro assays (Figure 22), 
but it is possible that inhibition of cellular genes involved in T cell activation could result in an 
indirect effect on HIV-1 replication. To further assess this possibility, a sensitive RNAase 
protection assay was performed for transcripts of a number of cytokine genes, including IL-2, 
15 IL-5, and IL-13 which differ by only single base mismatches from the target sequences 
flanking the TATA box m the FIIV-1 LTR. Four other cytokine genes that lack binding sites 
for either polyamide 1 or 3 in their promoters were also examined (Figure 24A). The results 
(Figure 24B) show thr.i exposure of activated human PBMC to a combination of either 
polyamides 1 + 3 or 2 + 4 (1 uM each) for 6 days failed to inhibit cytokine RNA expression. 
20 This lack of inhibition of cytokme gene transcription suggests that the polyamides reduce virus 
replication in cells by a direct effect on HlV-1 RNA transcription. 

Our present studies have utilized polyamides designed to target DNA sequences 6-7 bp 
in length and have shown that these compounds are effective inhibitors of gene transcription in 

25 cell-free systems and, viihi Infra, in human cells. Because sequences of these lengths would be 
highly redundant in the human genome, it had seemed likely that these ligands would have 
deleterious effects on cell metabolism due to interference with the activity of cellular genes. 
However, the results described here indicate that a battery of polyamides which recognize 6-7 
bp sequences will be suificient for gene-specific regulation /// vivo. It is interesting to compare 

30 these small molecule uanscription factors to eukaryotic transcriptional regulatory proteins 
which also recognize multiple sequences of similar length in tandem in order to increase 
functional specificity. '1 ae observations that polyamides do not mtcrfere with pol II elongation, 
and that i^olyamides c^iii bind simultaneously with certain major groove binding proteins 
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should liirther enhance geue-spccificity. M. G. Oakley, M. Mrksich, P, B. Dervan describe 
simultaneous binding of polyamide in the minor groove wiih a protein in the major groove. 
Biochemistry 10969 (1992). 

5 As shown in Figure 16, polyamides are not limited to 6-7 base-pair recognition, but can 

bind as cooperative diineis to sequences up to sixteen base pairs in length. Trauger, J.W., 
Baird, E. E. Mrksich, M., Dervan, P.B. describes the recognition of 9-13 base pairs of DNA by 
a p-alaninc linked extended polyamide dimer. 

10 The specific inhibition of genes transcribed by RNA polymerase II represents an 

important first step toward asking whether cell-permeable small molecule transcription 
antagonists might regulate gene expression in complex organisms. TBP and sequences 
adjacent to the TATA element for inhibition of basal transcription by RNA polymerase II were 
chosen. Siiice most tissue-specific cellular genes and viral genes contain TATA elements, this 

15 approach is generally applicable for the inhibition of most target genes. 

The inhibition of HIV-1 replication in peripheral blood mononuclear cells by 
polyamides is shown in Figure 22. Panels A-D depict three separate experiments in which 
polyamides alone (A, B) or in combination (C, D) were added to cultures of human peripheral 

20 blood mononuclear ceils (PBMC) stimulated 3 days earlier with phytohemaggiutinin and 
interleukin-2. PBMC cultures were infected with the primary HIV-1 isolate WEAU 1.6 
(kindly provided by G. Shaw), and virus replication measured by release of p24 capsid antigen 
into the medium that was detected by antibody-capture ELISA (Dupont). Each experiment 
involved a separate PBMC donor. Values shown are for day 6 or day 8 after virus infection. 

25 When two polyamides were combined, the concentration shown is for each component of the 
mixture. In panel D, the combinadon of 1 |.iM polyamide 1 -H 1 |li1VI polyamide 3 cooperatively 
blocked virus infection and p24 release (below 10 pg/ml, the detection limit of the assay), as 
did the addition of I |iM azidothymidine (AZT). Assays of p24 concentration were performed 
in duplicate and showed less than 5% variation from the mean value reported, 

30 

Figure 23 shows the kinetics of the inhibition of HIV-1 replication and effects on cell 
viability by a combinativui of polyamides added to PBMC cultures. PBMC were isolated as in 
Figure 22 and infected cither with the macrophage-tropic FIiV-1 isolate SF162 (panels A and 
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B) or the T-cell tropic isolate VVEAU 1.6 (panels C and D). Polyamides 1 atid 3 were added in 
combination at 1 |iM each. Although each polyamidc alone was not very effective in blocking 
replication of HIV-1 Sri62 (data not shown), the combination of both polyamides was able to 
stabilize and later reduce virus p24 production to undetectable levels (panel A, filled triangles) 
5 while unueated cultures (open triangles) continued viral replication. Cell viability was 
detemiined at days 4 and 10 of culture by trypan blue exclusion. 

The combination of polyamides 1 and 3 caused a small decrease in viability compared 
to cultures that were untreated (panel B, filled versus open triangles), but parallel cultures 

10 treated with 1 ]xM azidothymidine (AZT) showed a much more severe decline in cell viability. 
AZT completely inhibited virus replication of both HIV-1 SFI62 and WEAU 1.6 at this 
concentration (data not shown). In contrast to these results, each polyamidc alone partially 
inhibited replication oi HIV-1 WEAU 1.6 (see Figure 23 A & B), and the combination of 
polyamides 1 and 3 blocked replication at all times examined (panel C, filled circles) compared 

15 to untreated cultures (open circles). Cell viability was higher in cultures treated with 
polyamides 1 and 3 than in untreated cultures (panel D, filled circles versus open circles), 
probably because the cytopathic effect of HIV-1 infection was completely reversed. AZT 
again showed substantial toxicity for cells. Assays of p24 concentration were performed in 
duplicate and showed loss than 5% variation from the mean value reported. Similar results 

20 were obtained in two additional experiments. 

A lack of inhibinon of cytokine gene expression with polyamides targeted to the HIV-1 
promoter and enhancer sequence can be seen in Figure 24. In Figure 24A the sequences of 
the TATA-box region (taken from GenBank listings) of each of the cytokine/growth factor 

25 genes examined are show n with the TATA box in bold and the binding site for polyamide 1 
underlined. Single base mismatches are indicated in lower case. In Figure 24B, human 
peripheral blood mononuclear cells were cultured under the same conditions used for HIV-l 
infection. Cultures were cither left untreated, or 10 pM of polyamides 1 + 3 or polyamides 2 + 
4 added. After six d:iys. cells were harvested, total RNA extracted, and a ribonuclease 

30 protection assay performed with riboprobes specific for the indicated cytokines as well as for 
CD4, CDS and ribosomal protein L32. M. V. Hobbs, et ah, describe pattems of cytokine gene- 
expression by CD4+ T-ce!ls from young and old mice J. J/nniiinol 150, 3602 (1993). After 
digestion with RNAse Tl, protected fragments were separated by polyacrylamide gel 
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electrophoresis, and Ihu amount of labeled probe qiiantitated by Phosphorlniager analysis. 
Data are expressed as the intensity of each cytokine RNA relative to the intensity of the 
ribosomal L32 RNA band to standardize for RNA loading. There was no difference in the 
intensity of CD4 and CDS RNA bands between groups, indicating equivalent recovery of CD4 
5 and CDS T cells in ali cultures. Similar results were seen when the cultured cells were 
analyzed after 10 days, although tlie levels of RNA for most cytokines had declined in both 
polyamide treated and untreated cells. 



10 

Example 8: 
Differential Inhibition of Ets-1 
Binding With Two Polyamides 

Ets-I is another ceHular transcription factor required for high levels of HIV- 1 RNA 
15 synthesis (Figure 25). Figure 25 shows the schematic structures of polyamides targeted to the 
Ets-1 /LEF-1 region of the HlV-1 enliancer and the sequence of the binding sites for these 
polyamides. Polyamides I and 2 have been described above in the context of the LEF-1 
inhibition experiments above . Polyamide 3 (ImPy-p-ImPy-y-ImPy-p-lmPy-p-Dp) binds the 
sequence 5'-TGCTGCA-3' with a K,i of 0.05 nM, while the mismatch polyamide 4 (Imlm-p- 
20 ImIm-y-PyPy-P-PyPy-|^Dp) exhibits 100 fold lower affinity for binding that site. The binding 
sites for polyamides 1 and 3 are located immediately downstream (polyamide 1) and upstream 
(polyamide 3) of the Eis-1 DNA recognition sequence in the HIV-1 enliancer region (Figure 
25B). Polyamide 3 also recognizes sequences immediately flanking the TATA box of the 
HIV-1 projnoter, and was shown to inhibit DNA-binding of TBP and prevent basal 
25 transcription from this promoter. 



Schematic representation of the HIV-1 enhancer/promoter region with the DNA- 
binding sites for the transcription factors USF, J£ts-1, LEF-1, NF-kappaB, Spl and the TATA- 
binding protein TBP. The positions of the polyamide binding sites are indicated by vertical 
30 arrows. The DNA-binding sites for polyanudes I and 3 flanking the Ets-1 recognition site are 
shown in detail below, fhc Ets-l binding site is boxed, the GGA core recognition sequence, 
that is conserved in most Cts-protein binding sites, is indicated by a small rectangle. The LEF-1 
binding site is underlined. 
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Since the Ets-1 l:iinding site is flanked by and partially overlaps with both polyamide 1 
and polyamide 3 binding sites, whether Ets-1 binding could be inliibited by either one, or both 
of these two polyamides was tested by gel mobility shift experiments. When polyamides were 

5 preincubated with a radiolabeled double-stranded HIV oligonucleotide before adding Ets-1 
AN331, polyamide 1 had no effect on Ets-l DNA-binding, even al a concentration as high as 
400 nM (Figure 26 lanes 3-5). Polyamide 3, however, prevented the Ets-1 /DNA complex 
fom^ation (Figure 26, lanes 9-11). When both polyamides were combined, the degree of 
inhibition was very similar to the one observed with polyamide 3 alone (Figure 26, lanes 15- 

10 17), The two mismatch polyamides 2 and 4 did not prevent complex formation, either alone or 
combined (Figure 26, lanes 6-8, 12-14 and 18-20). A titration experiment revealed that 
polyamide 3 inhibited Bts-1 AN331/DNA complex fonnation by 50% al a concentration of 
approximately 6 nM, nearly complete inhibition was achieved between 50 and 200 nM 
polyamide. Mismatch polyamide 4 had virtually no effect in the sanie concentration range. 

15 

A double-stranded, labeled oligonucleotide corresponding to the Ets-1 binding site with 
flanking regions within Uie HIV- 1 enhancer/promoter was used as a probe in gel mobility shift 
assays. Polyamides were preincubated with the DNA before addition of Ets-1, where indicated. 
The concentrations of polyamides were 100 nM (lanes 3, 6, 9, 12, 15 and IS), 200 nM (lanes 4, 

20 7, 10, 13, 16 and 19), md 400 nM (lanes 5, 8, 11, 14, 17 and 20). AN331 protein was added at 
a concentraiion of 12 nM. Positions of the free probe (F) and bound probe (B) are indicated. 
Figure 27A shows a representation of an autoradiogram of a representative gel mobility shift 
assay showing the inhibitory effect of polyamide 3 on AN331 binding. The concentration of 
AN331 was constant (12 nM in lanes 2-10), the probe concentration was 50 pM. The 

25 polyamide was preincubated for 20 min at roon^ temperature before addition of the protein, 
followed by an additional 20-30 min incubation on ice. The polyamide concentrations were 
1.56, 3-125, 6.25, 12.5, 25, 50, 100, 200 niVI in lanes 3 to 10, respectively. Figure 27B is a 
graphical vcpresentaiion of the decrease of the fraction of bound probe as a function of 
polyamide concentration. Closed squares represent the data points obtained for polyamide 3, 

30 open squares represent mismatch polyamide 4. Figure 27C is a graphical repiesentation of 
the effect of polyamide 3 on Ets-1 binding when added at different time points during the 
binding reaction. Polyamide 3 was added to the probe either prior to addition of Ets-1 (closed 
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squares), or after addiiion of Ets-1 (open squares), or simultaneously with Ets-1 (closed 
triangles). 

Figure 28 is a representation of the results of DNase I footprint titration experiment 
with polyainides 1 and 3 in the absence (-) or presence (+) of 9.6 nM AN331 protein. 
Polyamides were incubated with the radiolabeled probe prior to addition of Ets-1. The 
polyamide concentrations were 0 nM (lanes 2, 6, 10, 14), 4 nM (hines 3, 7, 11, 15), 20 nM 
(lanes 4. 8, 12, 16) and 100 nM (lanes 5, 9, 13, 17). Lanes 1 and 18 show a G-t A sequencing 
ladder. The regions protected by polyamides and by Ets-1 are indicated alongside the 
sequencing ladder. Note the two DNase I hypersensitive sites characteristic of the Ets-1 
footprint. 

Polyamide 3 prevents Ets-1 DNA-binding while polyamide I coexists with Ets-1 on 
overlapping DNA binding sites: 

A labeled DNA fragment derived from the HIV-1 enhancer was incubated with each 
polyamide either alone, or followed by addition of Ets-1 and a further 30 minute incubation. As 
expected, both polyamides bound their target site with similar affinities, polyamide 1 even 
bound with a slightly higher affinity than polyamide 3, complete protection was observed with 
20 nM polyamide 1, and with 100 nM polyamide 3 (Figure 28, lanes 3-5 and 11-13). The Ets-1 
footprint is characterized by two DNase I hypersensitive bands appearing in the center and at 
the 5' boundary of the lootprint. These hypersensitive sites disappear with the addition of 20 to 
100 nM polyamide 3 (Figure 28, lanes 7-9), but remain unchanged with the addition of 
polyamide 1 (Figure 28, lanes 15-17). The simultaneous presence of polyamide 1 and Ets-1 
results in a broadening of the footprint, corresponding to a combined footprint created by Ets-1 
and polyamide 1 (Figure 28, lanes 14-17), while the Ets-1 footprint is replaced by the 
polyamide 3 footprint (lanes 7-9). Based on the structure of the Ets family protein PU.l in 
complex with its DNA recognition element, it has been proposed that DNA recognition is 
mediated by both major groove and minor groove contacts. In the case of Ets- U minor groove 
contacts would be expected to occur at the 5' end of the recognition clement. This structural 
predicnon is in complcic agreement v/ith our polyamide inliibition experiments. A polyamide 
located at the 5' end of the Ets-1 site is inhibitory lo DNA binding whereas a polyamide 
located at the 3' end of the Ets-1 site is not inhibitory to DNA binding. 
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Example 9: 

Iiiliibitioa of TBP Bitidiiig and Basal Transcription with Poh amides Designed to Bind 

Sequences Adjacent to the TATA Element 

5 

The previous Examples demonstrate that polyamides which bind sequences adjacent to 
the well-consei-ved TATA sequence upstream from the transcription start-site of messenger 
RNA-coding genes inhibit TBP binding and basal transcription by RNA polymerase II. This 
method is a useful general approach for modulation of gene expression. For example, specific 

10 polyamides have been designed and synthesized that bind to double stranded DNA adjacent to 
the TATA element of the human cytomegalovirus major immediate early promoter (CMV 
MIEP). Polyamide CMV-1, ImlmPyPy-yTmPyPyPy-P-Dp, binds the identified target 
sequence 5'-AGGTCT-3\ where the last T of this sequence is the 5' T of the TATA element. 
Polyamide CMV-1 binds the CMV MIEP with an apparent dissociation constant of 1 nM and 

15 is an effective inhibitor of TBP binding and basal transcription. Appropriate mismatch 
polyamides do not inhibit either I'BP binding or transcription. 

In another example, a polyamide was designed to bind immediately downstream of the 
TATA element found in the human Hcr-2/neu breast cancer oncogene promoter. This 

20 polyamide, Her2-1, of composition ImPy-p-Pylm-y-PyPy-p-PyPy-p-Dp, binds the sequence 
5'AGAATGA-3' (where the 5' A of this sequence is the 3' A of the TATA element) with an 
apparent dissociation constant oF 20 pM and is an cffectixe inhibitor of TBP binding and 
transcription. Her-2/neu is recognized to be ovcr-cxpresscd in several cancers, including 
human gynecologic adenocarcinomas, including those of the ovary, endometrium, breast, 

25 fallopian tube and cerx ix (Cirisano, F.D., & Karlan, B.Y., 7. Soc. Gynecol. Investig. 3 99-105 
(1996)). 

These additioiv.d data confirm the generality of this approach and demonstrate that 
polyamides located ciiher upstream or downstream of the TATA element are effective 
30 transcription inhibitors. Table 3, below, lists the gene promoters, TATA sequences and 
composition of the poK amidcs which have been shown to be inhibitors of TBP binding and 
transcription. 
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Table 3 




Polyamide Inhibition of TBP Binding and Basal Transcription by Targeting 




Sequences Adjacent to the TATA Element 


GENE 


TATA SEQUENCE 


POLYAMIDE 


Artificial 
TATA 


5 ' -TATAAGT ACTT-3 ' 


ImPyPyPy-Y-ImPyPyPy-3-Dp 


HIV-1 
Promoter 


5 ' -TGCTGCATATAA-3 ' 
5 ' -TATAAGC AGCT-3 ' 


ImPy-p-PyIm-Y-PyPy-|i-PyPy-P-Dp 


CMV 
MIEP 


5'-AGGTCTATAA-3' 


ImlmPy Py-y-ImPy PyPy- P-Dp 


Her-2/neu 
oncogene 
(Breast Cancer) 


5 ' -TATAAGAATGA-3 ' 


ImPy-P-Pylm-y-PyPy-P-PyPy-p-Dp 



Example 10: 

Anti-repression of Polymerase II Transcription by a Designed Ligand: 

5 

Pyrrole-imidazole polyamides can exert a positive effect on transcription by interfering 
with the activity of a specific repressor protein. The human cytomegalovirus (HCMV) IE86 
repressor protein is well suited for this study, because transcriptional repression is dependent 
on IE86 binding to its DNA target site. IE86 negatively regulates the major immediate early 

10 promoter (MIEP) of HCMV by binding to a sequence element (the cis repression signal, crs) 
located between the TATA box and the start of transcription. It was shown that IE86 exerts its 
negative effect on transcription by binding to the crs element, thereby blocking recruitment of 
RNA polymerase 11. The polyamides that specifically recognize the crs, prevent IE86 from 
binding and relieve transcriptional repression, while a mismatch polyamide or a polyamide 

15 which binds to a nearby site, have no effect. Furthermore, occupancy of the crs element by a 
small polyamide is not sufficient to repress transcription, supporting the model that IE86- 
mediated repression results from steric interference with the recruitment of RNA polymerase 
II to the preinitiation complex. 

20 Figure 29 is a schematic representation of the sequence of the HCMV MIEP region 

from position -34 to +7. The TATA box and the repressor binding site (located at -14 to +1 
relative to the start site of transcription) are boxed. The polyamides are schematically 
represented at their respective DNA binding sites. The shaded and open circles represent 
imidazole and pyrrole rings, respectively, the hairpin junction is formed by y-aminobutyric 
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acid, and the diamond represents p-alanine. Polyamides 2 and 3 contain an additional amino 
group at the hairpin junction. The mismatch in polyamide 4 is boxed. Structures of five 
polyamides are shown schematically: ImPylmPy-y-PyPyPyPy-P-Dp (1), ImPyImPy-2,4D- 
PyPyPyPy-P-Dp (2), InvP-ImPy-2,4D-PyPyPyPy-p-Dp (3), ImPyPyPy-y-PyPyPyPy-p-Dp 
5 (4) and ImlmPyPy-y-ImPyPyPy-P-Dp (5), which binds the sequence 5'-AGGTCT-3' adjacent 
to the TATA box. Im = imidazole, Py = pyrrole, y = y-aminobutyric acid, P = p-alanine, 2,4D 
= 2,4 diaminobutyric acid, and Dp =dimethylaminopropylamide. 

The eight ring polyamide 1, ImPylmPy-y-PyPyPyPy-p-Dp, binds the six base pair 
10 sequence 5'-AGTGAA-3' within the IE86 binding site with an apparent dissociation constant 
of 1 .4 nM. The predicted structure of this complex is shown in Figure 29 (polyamide 1). Two 
additional polyamides were synthesized that bind the same site, but with higher affinities: 
polyamide ImPylmPy-D-PyPyPyPy-p-Dp (polyamide 2, where 2,4D represents 2,4 
diaminobutyric acid) and Im-p-ImPy-2,4D-PyPyPyPy-p-Dp (polyamide 3) bind the sequence 
15 5'-AGTGAA-3' with K<]S of 1 nM and 0.25 nM, respectively. A single atom substitution in 
this molecule, changing an imidazole to a pyrrole (nitrogen to C-H; polyamide 4, shown in 
Figure 29), reduces the affinity of this compound by -30 to 100-fold for its target site. A 
control polyamide, ImlmPyPy-y-ImPyPyPy-p-Dp (polyamide 5) binds the sequence 5'- 
AGGTCT-3' adjacent to the TATA box of the MIEP with a of 1 nM. 

20 

These compounds together with purified recombinant IE86 protein were used in DNase 
I footprint experiments with a singly end-labeled restriction fragment derived from the CMV 
major immediate early promoter. As predicted, polyamide 1 protects the sequence 5'- 
AGTGAA-3' in the center of the IE86 binding site. Approximately 50% protection is seen 

25 with 2 nM polyamide, and complete protection is obtained with 200 nM polyamide 1 (Figure 
30, lanes 3-5). Recombinant protein 1E86 protects a region extending between positions -24 
and +3. Approximately 250 nM IE86 is required to give complete protection (Figure 30, lanes 
6 and 14). When polyamide 1 is preincubated with DNA for 15 minutes prior to addition of 
protein, the IE86 footprint is partially replaced by the polyamide 1 specific footprint at a 

30 concentration of 2 nM polyamide, and is completely replaced at a concentration of 200 nM 
polyamide (Figure 30, lanes 7-9). The control polyamide 5, which protects a site 5'-AGGTCT- 
3' immediately upstream of the TATA box, does not interfere with IE86 binding in a 
concentration range of 2-200 nM. Both the IE86-specific footprint and the polyamide 5 
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footprint coexist, even though they are only 9 base-pairs apart (Figure 30, lanes 14-17). A 
singly-end labeled restriction fragment was used in DNase I footprinting reactions. Reactions 
contained approximately 250 nM his-tagged IE86 where indicated, and polyamide 1 in the 
concentrations indicated at the top of the lanes. Lanes 1 and 18 represent a G+A sequencing 
ladder. The extent of the footprints created by polyamide 1, IE86 and the control polyamide 5 
are indicated by brackets alongside the autoradiogram. The location of the TATA box and the 
start of transcription are indicated. 

The same experiment was repeated with polyamides 2 and 3, which bind the same site 
5'-AGTGAA-3' with higher affinities than polyamide 1. A complete protection is seen with 5 
and 10 nM polyamide 3 (Figure 31, lane 4), which are also the concentrations required to 
completely inhibit IE86 from binding (Figure 31, lanes 8 and 9). A partial inhibition of IE86 
binding is detected at 1 nM polyamide 3 (Figure 31, lane 7). A similar result was obtained 
with polyamide 2 (data not shown). The mismatch polyamide 4 did not give a footprint on the 
CMV fragment in the concentration range used in these experiments (1 - 200 nM), and it did 
not interfere with IE86 binding (Figure 31, lanes 11-13 and 15-17). The IE86 concentration 
was approximately 250 nM. The concentrations of the polyamide are indicated in nM at the top 
of the lanes. The extent of the footprints for polyamide 3 and IE86 protein as well as the TATA 
box and transcription start site are shown at the side of the autoradiogram. 

The effects of the polyamides on MIEP transcription were tested in an in vitro system 
consisting of a cell-free extract prepared from cultured human CEM cells. It is known fi-om 
previous studies that IE86 inhibits transcription by binding to the cis repression sequence, and 
physically blocks the access of RNA polymerase II to the preinitiation complex. Whether 
polyamide 1, which has a relatively low binding affinity (in about the same range as IE86), 
could counteract the negative effect of IE86 on transcription was tested. Run-off transcription 
reactions were performed with linearized plasmid and CEM extract. The concentration of IE86 
that was needed to give approximately 75% inhibition of transcription was empirically 
determined. The polyamides were added at a concentration of 1 jiM. 

Figure 32 shows that IE86 inhibits CMV transcription by approximately 75% (lane 2). 
When polyamide 1 is preincubated with DN A prior to addition of IE86 protein, the level of 
transcription is restored to approximately 75% of the control (Figure 32, lane 3). Interestingly, 



wo 98/35702 



PCT/US98/02444 



-64- 

the presence of polyamide 1 bound to the crs in the absence of IE86, has no effect on 
transcription, even at a concentration of 1 \xM (Figure 32, lane 4). An unrelated polyamide, 
which has no binding site in the CMV promoter, has no effect on IE86-mediated repression, 
even at a concentration of 1 jaM (Figure 32, lane 5). Thus, polyamide 1 is able to specifically 
5 counteract transcriptional repression mediated by IE86 protein. 

The foregoing is intended to be illustrative of the present invention, but not limiting. 
Numerous variations and modifications of the present invention may be effected without 
departing from the true spirit and scope of the invention. 

10 
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We claim: 

1 . A method of modulating the expression of a cellular or viral gene comprising the steps 
of: 

a. identifying a unique target DNA sequence adjacent to the binding site of at least one 
minor groove transcription factor protein; 

b. chosing a polyamide having a subnanomolar affinity for the unique target DNA 
sequence; and 

c. contacting the unique target DNA sequence with a transcription inhibiting amount of 

the polyamide. 

2. The method of claim 1 comprising the additional step of designing the polyamide to be 
specific for the unique target DNA sequence. 

3. The method of claim 1 wherein the minor groove transcription factor protein is chosen 
from the group consisting of TFIIIA, TBP, LEF-1, and Ets-1. 

4. A method for modulating expression of a cellular or viral gene controlled by a minor 
groove transcription factor protein using a specifc polyamide including paired 
carboxamide residues^ comprising the steps of: 

a. identifying a target sequence of double stranded DNA adjacent to the binding site 
of the minor groove transcription factor protein, the target sequence having the form 
5'-WNiN2 , . . NmW-3', wherein N]N2 ... Nm is the sequence to be bound by 
carboxamide residues, wherein each N is independently chosen from the group A, 
G, C, and T, each W is independently chosen from the group A and T, and m is an 
integer having a value from 3 to 6; 

b. selecting a specific polyamide that is selective for identified target DNA sequences 
5'-WNiN2 . . , NniW-3' and having the form 

X1X2 . . . Xnry-X(m + I) . . . X(2m-l)X2m 

c. wherein Xi, X2, Xni, X(m + 1), X(2m - 1). and X2m are carboxamide residues 
forming carboxamide binding pairs Xi/X2m, X2/X(2m-1)5 Xni/X(in + 1), and y is 
y-aminobuytic acid or 2,4 diaminobutyric acid and Dp is 
dimethylaminopropylamide, where m is an integer having a value from 3 to 6; and 

d. contacting the target sequence with a transcription inhibiting amount of the specific 
polyamide. 
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5. The method of claim 4 further comprising the step of designing the specific polyamide 
of the form 

X1X2 . . . Xm-Y-X(m + 1) . . . X(2m-l)X2m- 
comprising the steps of; 

a. representing the identified target sequence as S'-Wab , . . xW-3\ wherein a is a first 
nucleotide to be bound by the Xi carboxamide residue, 6 is a second nucleotide to 
be bound by the X2 carboxamide residue, and x is the corresponding nucleotide to 
be bound by the Xm carboxamide residue; 

b. defining a as A, G, C, or T to correspond to the first nucleotide to be bound by a 
carboxamide residue in the identified sequence; 

c. selecting Im as the Xi carboxamide residue and Py as the X2m carboxamide 
residue if a = G; 

selecting Py as the Xi carboxamide residue and Im as the X2m carboxamide 
residue if a = C; 

e. selecting Hp as the Xi carboxamide residue and Py as the X2m carboxamide 
residue if a = T; 

f. selecting Py as the X] carboxamide residue and Hp as the X2m carboxamide 
residue if a = A; and 

g. repeating steps c - g for 6 through x until all carboxamide residues are selected. 

6. The method of claim 4 further comprising the step of replacing at least one pyrrole 
residue with a P-alanine residue. 

7. A method of inhibiting the replication of a pathogen by administering a transcription 
inhibiting amount of at least one polyamide compound. 

8. The method of claim 7 wherein the pathogen is chosen from the group consisting of 
viruses, bacteria, fungi and protozoans. 

9. The method of claim 7 wherein the pathogen is a retrovirus. 

10. The method of claim 7 wherein the pathogen is HIV-1 . 

1 1 . The method of claim 7 wherein the polyamide is chosen from the group consisting of 
ImPyPyPy-y-lmPyPyPy-p-Dp, ImPy-p-ImPy-y-ImPy-P-ImPy-P-Dp, ImPy-P- 
PyPyPyPy-p-ImPy-p-Dp, ImPy-P-ImPy-p-PyPyPyPy-p-ImPy-P-ImPy-p-Dp, ImPy-p- 
ImPyPyPy-y-ImPyPyPy-P-PyPy-p-Dp, ImlmPyPy-y-ImPyPyPy-P-Dp, and mixtures 
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thereof wherein Im is N-methylimidazole, Py is N-methylpyrrole, y is y-aminobutyric 
acid, p is P-alanine, and Dp is dimethylaminopropylamide. 

12. A composition comprising; 

a. a transcription inhibiting amount of at least one polyamide chosen from the group 

consisting of ImPyPyPy-y-ImPyPyPy-p-Dp, ImPy-P-ImPy-y-ImPy-p-ImPy-P-Dp, 
ImPy-p-PyPyPyPy-p4mPy-p-Dp, ImPy-p-ImPy-P-PyPyPyPy-P-ImPy-P-ImPy-p- 
Dp, ImPy-P-ImPyPyPy-y-ImPyPyPy-P-PyPy-P-Dp, ImlmPyPy-y-ImPyPyPy-p-Dp, 
and mixtures thereof wherein Im is N-methylimidazole, Py is N-methylpyrrole, y is 
y-aminobutyric acid, p is P-alanine, and Dp is dimethylaminopropylamide; and 

b. a pharmaceutically acceptable excipient. 

13. The composition of claim 12 wherein the composition is suitable for parenteral use. 

14. A method of improving the binding affinity of a polyamide selective for an identified 
target DNA in a viral genome comprising the step of replacing a carboxamide binding 
pair that does not include N-methyl-imidazole carboxamide with a carboxamide 
binding pair consisting of P-alanine paired with P-alanine. 

15. A method of inhibiting the binding of the zinc fmger protein TFIIIA to the 5S 
ribosomal RNA gene internal control region, comprising: 

contacting the 5S ribosomal RNA gene internal control region with an inhibiting 
amount of a polyamide of the formula ImPyPyPy-y-ImPyPy-P-Dp, wherein Im is N- 
methylimidazole, Py is N-methylpyrrole, y is y-aminobutyric acid, P is p-alanine, and 
Dp is dimethylaminopropylamide, 

16. A composition comprising a transcription inhibiting amount of at least one polyamide 
chosen from the group consisting of ImPyPyPy-y-ImPyPy-p-Dp, ImPy-P-ImPy-y- 
ImPy-p-ImPy-P-Dp and mixtures thereof and a pharmaceutically acceptable excipient 
suitable for the treatment of HIV- 1 infection. 

17. A method of treating a human patient with an HIV-1 infection comprising the step of 
administering a composition comprising a transcription inhibiting amount of at least 
one polyamide chosen from the group consisting of ImPyPyPy-y-ImPyPy-P-Dp, ImPy- 
P-ImPy-y-ImPy-P-ImPy-P-Dp and mixtures thereof and a pharmaceutically acceptable 
excipient. 

18. A method of treating HIV-1 infected human blood cells in vitro comprising the step of 
administering a composition comprising a transcription inhibiting amount of at least 
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one polyamide chosen from the group consisting of ImPyPyPy-y-ImPyPy-P-Dp, ImPy- 
P-ImPy-y-ImPy-P-ImPy-P-Dp and mixtures thereof 

19. A composition comprising a transcription inhibiting amount of ImPy-(3-ImPy-Y-PyPy- 
P-PyPy-P-Dp and a pharmaceutically acceptable excipient suitable for the treatment of 

5 a cancer charactertized by the over-expression of her-2/neu. 

20. A method of treatment of a adenocarcinoma of the ovary, endometrium, breast, 
fallopian tube and cervix comprising administering the composition of claim 19. 
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